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5 FIELD OF THE INVENTION 

The present invention is related to SHB-GAS-102, SHB-GAS-103, 
and SHB-GAS-104 polypeptides of pyogenes (Group A 

Streptococcus) and corresponding DNA fragments, which may be 
used to prevent, diagnose and/or treat pyogenes infections . 

10 

BACKGROUND OF THE INVENTION 

Streptococci are gram (+) bacteria which are differentiated by 
group specific carbohydrate antigens A through O which are found 
at the cell surface. pyogenes isolates are further 

15 distinguished by type-specific M protein antigens. M proteins 
are important virulence factors which are highly variable both 
in molecular weights and in sequences. Indeed, more than 100-M 
protein types have been identified on the basis of antigenic 
differences. 

20 

JL. pyogenes is responsible for many diverse infection types, 
including pharyngitis, erysipelas and impetigo, scarlet fever, 
and invasive diseases such as bacteremia and necrotizing 
fasciitis. A resurgence of invasive disease in recent years has 
25 been documented in many countries, including thos-e in North 
America and Europe. Although the organism is sensitive to 
antibiotics, the high attack rate and rapid onset of sepsis 
results in high morbidity and mortality. 

30 To develop a vaccine that will protect individuals from 
pyogenes infection, efforts have focused on virulence factors 
such as the type-specific M proteins. However, • the carboxy- 
terminal portion of M proteins was found to induce cross- 
reactive antibodies which reacted with human myocardium, 

1 
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tropomyosin, myosin,, and vimentin, which might" be implicated in 
autoimmune diseases. Others have used recombinant techniques to 
produce complex hybrid proteins containing amino-terminal 
peptides of M proteins from different serotypes. However, a safe 
5 vaccine containing all S_^ pyogenes serotypes will be highly 
complex to produce and standardize. 

In addition to the serotype-specif ic antigens, other pyogenes 
proteins have generated interest as potential vaccine 
candidates. The C5a peptidase, which is expressed by at least 

10 pyogenes 40 serotypes, was shown to be immunogenic in mice, but 
its capacity to reduce the level of nasopharyngeal colonization 
was limited. Other investigators have also focused on the 
streptococcal pyrogenic exotoxins which, appear to play an 
important role in pathogenesis of infection. Immunization with 

15 these proteins prevented the deadly symptoms of toxic shock, but 
did not prevent colonization. 

Therefore there remains an unmet need for . S . pyogenes antigens 
that may be used as vaccine components, for the prophylaxis 
!0 and/or therapy of pyogenes infection. 

SUMMARY OF THE INVENTION 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
5 70% identity to a second polypeptide comprising SEQ ID No : 2, 
4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof. 

According to one aspect, the present invention relates to 
polypeptides comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 4 0, 42, 44 or fragments or analogs thereof. 

In other aspects, there are provided polypeptides encoded by 
polynucleotides of the invention, pharmaceutical compositions, 
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vectors comprising polynucleotides of the invention operably 
linked to an expression control region, as well as host cells 
transfected with said vectors and processes for producing 
polypeptides comprising culturing said host cells under 
5 conditions suitable for expression. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 represents the DNA sequence of SHB-GAS-102 gene from 
serotype Ml S^ pyogenes strain ATCC700294; SEQ ID NO: 1. 
10 Figure 2 represents the amino acid sequence SHB-GAS-102 
polypeptide from serotype Ml pyogenes strain ATCC700294; SEQ 
ID NO: 2 . 

Figure 3 represents the DNA sequence of SHB-GAS-103 gene from 
15 serotype Ml S^ pyogenes strain ATCC70O2 94; SEQ ID NO: 3. The 
underlined portion of the sequence represents the region coding 
for the leader peptide. 

Figure 4 represents the amino acid sequence SHB-GAS-103 
20 polypeptide from serotype Ml S^ pyogenes strain ATCC700294 ; SEQ 
ID NO: 4. The underlined sequence represents the 27 amino acid 
residues leader peptide. 

Figure 5 represents the DNA sequence of SHB-GAS-104 gene from 
25 serotype Ml S^ pyogenes strain ATCC700294; SEQ ID NO: 5. The 
underlined sequence represents the region coding for the leader 
peptide. 

Figure 6 represents the amino acid sequence SHB-GAS-104 
»0 polypeptide from serotype Ml S^ pyogenes strain ATCC7XKJ294; SEQ 
ID NO: 6. The underlined sequence represents the 19 amino aci-d 
residues leader peptide. 
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Figure 7 represents the DNA sequence of SHB-~GAS-102 gene from 
serotype M3 pyogenes strain MGAS315; SEQ ID NO: 21. 

Figure 8 represents the amino acid sequence SHB-GAS-102 protein 
5 from M3 pyo genes strain MGAS315; SEQ ID NO: 22. 

Figure 9 represents the DNA sequence of SHB-GAS-102 gene from 
serotype M3 S^ pyogenes strain SSI-1; SEQ ID NO: 23. 

10 Figure 10 represents the amino acid sequence SHB-GAS-102 protein 
from M3 S^ pyogenes strain SSI-1; SEQ ID NO: 24. 

Figure 11 represents the DNA sequence of SHB-GAS-102 gene from 
serotype M5 pyogenes strain Manfredo; SEQ ID NO: 25. 

15 

Figure 12 represents the amino acid sequence SHB-GAS-102 protein 
from M5 S^ pyogenes strain Manfredo; SEQ ID NO: 26. 

Figure 13 represents the DNA sequence of SHB-GAS-102 gene from 
0 serotype M18 S^ pyogenes strain MGAS8232; SEQ ID NO: 27. 

Figure 14 represents the amino acid sequence SHB-GAS-102 protein 
from. Ml 8 S^ pyogenes strain MGAS8232; SEQ ID NO: 28. 

Figure 15 represents the DNA sequence of SHB-GAS-103 gene from 
serotype M3 S^ pyogenes strain MGAS315; SEQ ID NO: 29. The 
underlined portion of the sequence represents the region coding 
for the leader peptide. 

Figure 16 represents the amino acid sequence SHB-GAS-103 protein 
from M3 S^ pyogenes strain MGAS315; SEQ ID NO: 30. The 
underlined sequence represents the 27 amino acid residues leader 
peptide. 
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Figure 17 represents the DNA sequence of SHB-GAS-103 gene from 
serotype M3 pyogenes strain SSI-1; SEQ ID NO: 31. The 

underlined portion of the sequence represents the region coding 
for the leader peptide. 

5 

Figure 13 represents the amino acid sequence SHB-GAS-103 protein 
from M3 pyogenes strain SSI-1; SEQ ID NO: 32/ The underlined 
sequence represents the 27 amino acid residues leader peptide. 

10 Figure 19 represents the DNA sequence of SHB-GAS-103 gene from 
serotype M5 S . pyogenes strain Manfredo; SEQ ID NO: 33. The 
underlined portion of the sequence represents the region coding 
for the leader peptide. 

15 Figure 20 represents the amino acid sequence SHB-GAS-103 protein 
from M5 S_^ pyogenes strain Manfredo; SEQ ID NO: 34. The 
underlined sequence represents the 27 amino acid residues leader 
peptide. 

20 Figure 21 represents- the DNA sequence of SHB-GAS-103 gene from 
serotype M18 pyogenes strain MGAS8232; SEQ ID NO: 35 . The 

underlined portion - of the sequence represents the region coding 
for the leader peptide. 

-5 Figure 22 represents the amino acid sequence SHB-GAS-103 protein 
from M18 pyogenes strain MGAS8232; SEQ ID NO: 3€. The 

underlined sequence represents the 27 amino acid residues leader 
peptide. 

) Figure 23 represents the DNA sequence of SHB-GAS-104 gene from 
serotype M3 pyogenes strain MGAS315; SEQ ID NO: 37. The 

underlined portion of the sequence represents the region coding 
for the leader peptide. 
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Figure 24 represents the amino acid sequence SHB-G7\S-104 protein 
from M3 pyogenes strain MGAS315; SEQ ID NO: 38. The 

underlined sequence represents the 19 amino acid residues leader 
peptide. 

5 

Figure 25 represents the DNA sequence of SHB-GAS-104 gene from 
serotype M3 pyogenes strain SSI-1; SEQ ID NO: 39. The 

underlined portion of the sequence represents the region coding 
for the leader peptide. 

10 

Figure 26 represents the amino acid sequence SHB-GAS-1 04 protein 
from M3 S_^ pyogenes strain SSI-1; SEQ ID NO: 40. The underlined 
sequence represents the 19 amino acid residues leader peptide. 

15 Figure 27 represents the DNA sequence of SHB-GAS-104 gene from 
serotype Mb S_^ pyogenes strain Manfredo; SEQ ID NO: 41. The 
underlined portion of the sequence represents the region coding 
for the leader peptide. 

•0 Figure 28 represents the amino acid sequence SHB-GAS-104 protein 
from M5 S_^ pyogenes strain Manfredo; SEQ ID NO: 42. The 
underlined sequence represents the 19 amino acid residues leader 
peptide . 

5 Figure 29 represents the DNA sequence of SHB-GAS-104 gene from 
serotype M18 S^ pyogenes strain MGAS8232; SEQ ID NO: 43. The 
underlined portion of the sequence represents the region coding 
for the leader peptide. 

» Figure 30 represents the amino acid sequence SHB-GAS-104 protein 
from M18 S_^ pyogenes strain MGAS8232; SEQ ID NO: 44. The 
underlined sequence represents the 19 amino acid residues leader 
peptide. 
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Figure 31 depicts the comparison of the nucleotide sequences of 
the SHB-GAS-102 genes from the S^ pyogenes serotype Ml 
ATCC700294 (SEQ ID N0:1), serotype M3 MGAS315 (SEQ ID N0:21), 
serotype M3 SSI-1 (SEQ ID NO:23), serotype M5 Manfredo (SEQ ID 
5 NO:25) and serotype M18 MGAS8232 (SEQ ID NO:27) strains by using 
the program Clustal W from NTI sequence analysis software- 
Figure 32 depicts the comparison of the predicted amino acid 
sequences of the SHB-GAS-102 open reading frames from the S. 
10 pyogenes serotype Ml . ATCC700294 (SEQ ID NO:2), serotype M3 
MGAS315 (SEQ ID NO:22), serotype M3 SSI-1 (SEQ ID NO:24), 
serotype M5 Manfredo (SEQ ID NO-.26) and serotype M18 MGAS8232 
(SEQ ID NO-.28) strains by using the program Clustal W from NTI 
sequence analysis software. 

15 

Figure 33 depicts the comparison of the nucleotide sequences of 
the SHB-GAS-103 genes from the S_. pyogenes serotype Ml 
ATCC700294 (SEQ ID NO: 3), serotype M3MGAS315 (SEQ ID NO : 2 9 ) , 
serotype M3 SSI-1 (SEQ ID NO: 31), serotype M5 Manfredo (SEQ ID 
0 NO: 33) and serotype Ml 8 MGAS8232. (SEQ ID NO: 35) strains by using 
the program Clustal W from NTI sequence analysis software. 

Figure 34 depicts the comparison of the predicted amino ado 1 
sequences of the SHB-GAS-103 open reading frames from the S. 
".pyogenes serotype Ml ATCC700294 (SEQ ID N0:4), serotype M3 
MGAS315 (SEQ ID NO:30), serotype M3 SSI-1 (SEQ ID NO:32), 
serotype M5 Manfredo (SEQ ID NO: 34) and serotype M18 MGAS8232 
(SEQ ID NO: 36) strains by using the program Clustal W from NTI 
sequence analysis software. 

Figure 35 depicts the comparison of the nucleotide sequences of 
the SHB-GAS-104 genes from the S^ pyogenes serotype Ml 
ATCC700294 (SEQ ID NO: 5), serotype M3 MGAS315 (SEQ ID NO: 37), 
serotype M3 SSI-1 (SEQ ID NO: 39), serotype M5 Manfredo (SEQ ID 
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N0:41), and serotype M18 MGAS8232 (SEQ ID NO:43) strains by 
using the program Clustal W from NT1 sequence analysis software. 

Figure 36 depicts the comparison of the predicted amino acid 
5 sequences of the SHB-GAS-104 open reading frames from the S . 
pyogenes serotype Ml ATCC700294 ( SEQ ID NO: 6), serotype M3 
MGAS315 (SEQ ID NO:38) f serotype M3 SSI-1 (SEQ ID NO:40), 
serotype Mb Manfredo (SEQ ID NO: 42) and serotype M18 MGAS8232 
(SEQ ID NO: 44) strains by using the program Clustal W from NTI 
10 sequence analysis software. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides purified and isolated 
polynucleotides, which encode S^ pyogenes polypeptides which may 
15 be used to prevent, diagnose and/or treat S_^ pyogenes infection. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
70% identity to a second polypeptide comprising SEQ ID No : .2, 
10 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
5 80% identity to a second polypeptide comprising SEQ ID No : 2, 
4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof. 

According to one aspect, the present invention provides an 
i isolated polynucleotide encoding a polypeptide having at least 
90% identity to a second polypeptide comprising SEQ ID No : 2, 
4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof. 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
95% identity to a second polypeptide comprising SEQ ID No : 2, 
4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44. 

5 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
98% identity to a second polypeptide comprising SEQ ID No : 2, 
4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44. 

10 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at least 
99% identity to a second polypeptide comprising SEQ ID No : 2, 
4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44. 

15 

According to one aspect, the present invention relates to 
polypeptides comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42, 44 or fragments or analogs thereof . 

0 According to one aspect, the present invention relates to 
polypeptides comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42, 44. 

According to one aspect, the present invention provides a 
5 polynucleotide encoding an epitope bearing portion of a 
polypeptide comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42, 44 or fragments or analogs thereof. 

According to one aspect, the present invention provides a 
polynucleotide encoding an epitope bearing portion of a 
polypeptide comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42, 44 
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According to one aspect, the present invention relates to 

epitope bearing portions of a polypeptide comprising SEQ ID No : 

2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof. 

5 

According to one aspect, the present invention relates to 

epitope bearing portions of a polypeptide comprising SEQ ID No : 

2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44. 

10 According to one aspect, the present invention provides an 
isolated polynucleotide comprising a polynucleotide chosen from: 
(a) a polynucleotide encoding a polypeptide having at least 95% 
identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44 or fragments or analogs thereof; 
15 (b) a polynucleotide encoding a polypeptide having at least 98% 
identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44 or fragments or analogs thereof; 

(c) a polynucleotide encoding a polypeptide having at least 99% 
identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 

.0 34 , 36, 38, 40, 42, 44 or fragments or analogs thereof; 

(d) a polynucleotide encoding a polypeptide comprising SEQ ID 
No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 
4 4 or fragments or analogs thereof; 

(e) a polynucleotide encoding a polypeptide capable of raising. 
5 antibodies having binding specificity for a polypeptide 

comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 44 or fragments or analogs th-ereof; 

(f) a polynucleotide encoding an epitope bearing portion of a 
polypeptide comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 
thereof; 

(g) a polynucleotide comprising SEQ ID No : 1, 3, 5, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39/ 41, 43 or fragments or 
analogs thereof; 

11 
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(h) a polynucleotide that is complementary to a polynucleotide 
in (a) , (b) , (c) , (d) , (e) , (f ) or (g) . 

According to one aspect, the present invention provides an 
5 isolated polynucleotide comprising a polynucleotide chosen from: 

(a) a polynucleotide encoding a polypeptide having at least 95% 
identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42 or 44; 

(b) a polynucleotide encoding a polypeptide having at least 98% 
10 identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 

34, 36, 38, 40, 42 or 44; 

(c) a polynucleotide encoding a polypeptide having at least 99% 
identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42 or 44; 

15 (d) a polynucleotide encoding a polypeptide comprising SEQ ID 
No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 3.6, 38, 40, 42 or 
44; 

(e) a polynucleotide encoding a polypeptide capable of raising 
antibodies having binding specificity for a polypeptide 

>0 comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 

36, 38, 40, 42 or 44; 

(f) a polynucleotide encoding an epitope bearing portion of a 
polypeptide comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42 or 44; 

5 (g) a polynucleotide comprising SEQ ID No : 1, 3, 5, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39, 41, 43; 
(h) a polynucleotide that is complementary to a. polynucleotide 
in (a), (b), (c), (d), (e) , (f) or (g) . 

) According to one aspect, the present invention provides an 
isolated polynucleotide consisting essentially of a 
polynucleotide chosen from: 

a) a polynucleotide encoding a polypeptide having at least 
95% identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 

12 
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32, 34, 36, 38, 40, 42, 44 or fragments or analogs 
thereof ; 

b) a polynucleotide encoding a polypeptide having at least 
98% identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 

5 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 

thereof ; 

c) a polynucleotide encoding a polypeptide having at least 
99% identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32., 34, 36, 38, 40, 42, 44 or fragments or analogs 

10 thereof; 

d) a polynucleotide encoding a polypeptide having SEQ ID 
No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44 or fragments or analogs thereof; 

e) a polynucleotide encoding a polypeptide capable of 
15 raising antibodies having binding specificity for a 

polypeptide having SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 
thereof; 

f) a. polynucleotide encoding an epitope bearing portion of 
20 a polypeptide having SEQ ID No : 2, 4, 6, 22, 24, 26, 

28, 30, 32, 34, 36, 38, 40, 42, 44 or fragments or 
analogs thereof; 

g) a polynucleotide ha L ving SEQ ID No : 1, 3, 5, 21, 23, 25, 
27, 29, 31, 33, 35, 37, 39, 41, 43 or fragments or 

25 analogs thereof; 

h) a polynucleotide that is complementary to a 
polynucleotide in (a), (b) , (c) , (d) , (e) , (f ) or (g) 

wherein said polynucleotide encodes a polypeptide that is 
immunogenic . 



According to one aspect, the present invention provides an 
isolated polynucleotide consisting essentially of a 
polynucleotide chosen from: 
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a) a polynucleotide encoding a polypeptide having at least 
95% identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42 or 44; 

b) a polynucleotide encoding a polypeptide having at least 
5 98% identity to SEQ I'D No : 2, 4, 6, 22, 24, 26, 28, 30, 

32, 34, 36, 38, 40, 42 or 44; 

c) a polynucleotide encoding a polypeptide having at least 
99% identity to SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36, 38, 40, 42 or 44; 

1° d ) a polynucleotide encoding a polypeptide having SEQ ID 

No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 
or 44; 

e) a polynucleotide encoding a polypeptide capable of 
raising antibodies having binding specificity for a 

15 polypeptide having SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 

30, 32, 34, 36, 38, 40, 42 or 44; 

f) a polynucleotide encoding an epitope bearing portion of 
a polypeptide having SEQ ID No : 2, 4, 6, 22, 24, 26,. 
28, 30, 32, 34, 36, 38, 40, 42 or 44; 

20 g) a polynucleotide having SEQ ID No : 1, 3, 5, 21, 23, 25, 

27, 29, 31, 33, 35, 37, 39, 41 or 43 ; 
h) a polynucleotide that is complementary to a 
polynucleotide in (a), (b) , (c) , (d) , (e) , (f) or (g) 
wherein said polynucleotide encodes a polypeptide that is 
25 immunogenic. 

According to one aspect, the present invention provides an 
isolated polypeptide comprising a polypeptide chosen from: 

(a) a polypeptide having at least 95% identity to SEQ ID No : 
30 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 

fragments or analogs thereof; 

(b) a polypeptide having at least 98% identity to SEQ ID No : 
2, 4, 6> 22, 24, 26, 28, 30, 32, 34, 36, 36, 40, 42, 44 or 
fragments or analogs thereof; 

14 



WO 2005/111 7»W |>< T/CA2OO4/O01S10 

(c) a polypeptide having at least 99% identity to SEQ ID No : 
2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof; 

(d) a polypeptide comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 
5 28, 30, 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 

thereof; 

(e) a polypeptide capable of raising antibodies having binding 
specificity for a polypeptide comprising SEQ ID No : 2, 4, 
6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 

10 fragments or analogs thereof; 

(f) an epitope bearing portion of a polypeptide comprising SEQ 
ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44 or fragments or analogs thereof; 

(g) the polypeptide of (a) , (b) , (c) , (d) , (e) or (f) wherein 
15 the N-terminal Met residue is deleted; 

(h) the polypeptide of (a), (b) , (c) , (d) , (e) or (f) wherein 
the secretory amino acid sequence is deleted. 

According to one aspect, the present invention provides an 
20 isolated polypeptide comprising a polypeptide chosen from: 

(a) a polypeptide having at least 95% identity to SEQ ID No : 
2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 or 44; 

(b) a polypeptide having at least 98% identity to SEQ ID No : 
2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 or 44; 

'-5 (c) a polypeptide having at least 99% identity to SEQ ID No : 

2, .4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 or 44; 

(d) a polypeptide comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 
28, 30, 32, 34, 36, 38, 40, 42 or 44; 

(e) a polypeptide capable of raising antibodies having binding 
0 specificity for a polypeptide comprising SEQ ID No : 2, 4, 

6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 or 44; 

(f) an epitope bearing portion of a polypeptide comprising SEQ 
ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 
or 44 ; 
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(g) the polypeptide of (a), (b) , (c) , (d) , (e) or (f) wherein 
the N-terminal Met residue is deleted; 

(h) the polypeptide of (a), (b) , (c), (d) , (e) or (f) wherein 
the secretory amino acid sequence is deleted. 

5 

According to one aspect, the present invention provides an 
isolated polypeptide consisting essentially of a polypeptide 
chosen from: 

a) a polypeptide having at least 95% identity to an amino 
10 acid sequence having SEQ ID NO: 2, 4, 6, 22, 24, 26, 28, 

30, 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 
thereof; 

b) a polypeptide having at least 98% identity to an amino 
acid sequence having SEQ ID NO: 2, 4, 6, 22> 24, 26, 28, 

15 30, 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 

thereof; 

c) a polypeptide having at least 99% identity to an amino 
acid sequence having SEQ ID NO: 2, 4, 6, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 

10 thereof; 

d) a polypeptide having SEQ ID NO: 2, 4, 6, 22, 24, 26, 28, 
30, 32,. 34, 36, 38, 40, 42, 44 or fragments or analogs 
thereof; 

e) a polypeptide capable of raising antibodies having 
5 binding specif icity for a polypeptide having SEQ ID NO: 

2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 
or fragments or analogs thereof; 

f ) an epitope bearing portion of a polypeptide having SEQ ID 
NO: 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 

) 44 or fragments or analogs thereof; 

g) the polypeptide of (a), (b) , (c) , (d) , (e) or (f) wherein 
the N-terminal Met residue is deleted; 

h) the polypeptide of (a), (b) , (c) , (d) , (e) , or (f) 
wherein the secretory amino acid sequence is deleted 
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wherein said polypeptide is immunogenic. 
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According to one aspect, the present invention provides an 
isolated polypeptide comprising a polypeptide chosen from: 
5 a) a polypeptide having at least 95% identity to an amino acid 
sequence having SEQ ID NO: 2, 4, 6, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42 or 44; 

b) a polypeptide having at least 98% identity to an amino acid 
sequence having SEQ ID NO: 2/4, 6, 22, 24, 26, 28, 30, 32, 

10 34, 36, 38, 40, 42 or 44/ 

c) a polypeptide having at least 99% identity to an amino acid 
sequence having SEQ ID NO: 2, 4, 6, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42 or 44; 

d) a polypeptide comprising SEQ ID NO: 2, 4, 6, 22, 24, 26, 
15 28, 30, 32, 34, 36, 38, 40, 42 or 44; 

e) a polypeptide capable of raising antibodies having binding 
specificity for a polypeptide having SEQ ID NO: 2, 4, 6, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 or 44; 

f) an epitope bearing portion of a polypeptide having SEQ ID 
20 NO: 2 ' *, 6, 22, 24, 26, 28, 30, 32, 34, 36, 3fl, 40, 42 or 

44; 

g) the polypeptide of (a), <b) , (c) , (d) , (e) or <f) wherein 
the N-terminal Met residue is deleted; 

h) the polypeptide of (a), (b) , (c) , (d) , (e) , or (f) wherein 
the secretory amino acid sequence is deleted. 



Those skilled' in the art will, appreciate that the invention 
includes DNA molecules, i.e. polynucleotides and their 
complementary sequences that encode analogs such as mutants, 
0 variants, homologues and derivatives of such polypeptides, as 
described herein in the present patent application. The 
invention also includes RNA molecules corresponding to the DNA 
molecules of the invention. In addition to the DNA and RNA 
molecules, the invention includes the corresponding polypeptides 



17 



WO 2005/017093 PCT/C A 2004/001 510 

and monospecific antibodies that specifically bind to such 
polypeptides . 

In accordance with the present invention, all polynucleotides 
5 encoding polypeptides of the present invention are within the 
scope of the present invention. 

In a further embodiment, the polypeptides in accordance with the 
present invention are antigenic, i.e. are able to bind 
10 specifically to components of the immune response, such as 
antibodies and lymphocytes. 

In a further embodiment, the polypeptides in accordance with the 
present invention are immunogenic or can elicit an immune 
15 response in a host. 

In a further embodiment, the present invention also relates to 
polypeptides which are able to raise antibodies having binding 
specificity to the polypeptides of the present invention as 
>0 defined above. 

An antibody that "has binding specificity" is an antibody that 
recognizes and binds the selected polypeptide but which does not 
substantially recognize and bind other molecules in a sample, 
5 e.g., a biological sample, which naturally includes the selected 
peptide. Specific binding can be measured using an ELISA assay 
in which the selected polypeptide is used as an antigen. 

In a further embodiment, the polypeptides in accordance with the 
) present invention can elicit a B cell response in a host. 

In a further embodiment, the polypeptides in accordance with the 
present invention can elicit a T cell response in a host. 
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KIP — 

In accordance with the present invention, the polypeptides of 

the invention can also be effective when administered to an host 
to protect against the bacteria. "Protection" in the biological 
studies is defined by a significant increase in the survival 

5 curve, rate or period. Statistical analysis using the Log rank 
test to compare survival curves, and Fisher exact test to 
compare survival rates and numbers of days to death, 
respectively, can be used to calculate P values and determine 
whether the difference between the two groups is statistically 

10 significant. P values greater than 0.05 are regarded as not 
significant. 

In an additional aspect of the invention there are provided 
antigenic/immunogenic fragments of the polypeptides of the 
15 invention, or of analogs thereof. 

The present invention also relates to fragments which are 
specific to, or for SEQ ID NOS. 2, 4, 6, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42 or 44. A specific fragment contains a 
20 defined order of amino acids which occurs in a target 
polypeptide, and which is characteristic of that target 
polypeptide, but substantially no other non-target polypeptides. 
Such polypeptide fragments can be of any size which is necessary 
to confer specificity, e.g., comprising or consisting of at 
5 least 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 
70, 75, 80, 85, 90, 95, 100, 105, 110, 115 amino acids, etc. 

Specific fragments can also be described as being specific for 
Streptococcus pyogenes , indicating that it occurs in that 
) bacteria, but not in other organisms, especially not in GBS or 
other bacteria. 



The fragments of the present invention should include one or 
more such epitopic regions or be sufficiently similar to such 
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^7 _ 
regions to retain substantially their antigenic/immunogenic 

properties. Thus, for fragments according to the present 

invention the degree of identity is perhaps irrelevant, since 

they may be 100% identical to a particular part of a polypeptide 

5 or analog thereof as described herein. 

The present invention further provides fragments having a 
smaller sequence than the ones described in the figures. 

10 The present invention further provides fragments having at least 
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In one embodiment, at least 120 contiguous amino acid residues. 

In one embodiment, at least 125 contiguous amino acid residues. 

In one embodiment, at least 130 contiguous amino acid residues. 

In one embodiment, at least 135 contiguous amino acid residues. 

5 In one embodiment, at least 140 contiguous amino acid residues. 

In one embodiment, at least 145 contiguous amino acid residues. 

In one embodiment, at least 150 contiguous amino acid residues. 

In one embodiment, at least 155 contiguous amino acid residues. 

In one embodiment, at least 160 contiguous amino acid residues. 

10 In one embodiment, at least 165 contiguous amino acid residues. 

In one embodiment, at least 170 contiguous amino acid residues. 

In one embodiment, at least 175 contiguous amino acid residues. 

In one embodiment, at least 180 contiguous amino acid residues. 

In one embodiment, at least 185 contiguous amino acid residues. 

15 In one embodiment, at least 190 contiguous amino acid residues. 

In one embodiment, at least 195 contiguous amino acid residues. 

In one embodiment, at least 200 contiguous amino acid residues 

In one embodiment, at least 205 contiguous amino acid residues 

In one embodiment, at least 210 contiguous amino acid residues 

20 In one embodiment, at least 215 contiguous amino acid residues 

In one embodiment , at least 220 contiguous amino acid residues 

In one embodiment, at least 225 contiguous amino acid residues 

In one embodiment, at least 230 contiguous amino aoid residues 

In one embodiment, at least 235 contiguous amino acid residues . 

25 In one embodiment, at least 240 contiguous amino acid residues 

In one embodiment, at least 245 contiguous amino acid residues 

In one embodiment, at least 250 contiguous amino acid residues 

In one embodiment, at least 255 contiguous amino acid residues 

In one embodiment, at least 260 contiguous amino acid residues 

10 In one embodiment, at least 265 contiguous amino acid residues 

In one embodiment, at least 270 contiguous amino acid residues 

In one embodiment, at least 27 5 contiguous amino acid residues 

In one embodiment, at least 280 contiguous amino aoid residues 

In one embodiment, at least 285 contiguous amino acid residues 
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In one embodiment, at least 290 contiguous amino acid residues. 
In one embodiment, at least 295 contiguous amino acid residues. 
In one embodiment, at least 300 contiguous amino acid residues. 
In one embodiment, at least 305 contiguous amino acid residues. 
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Polypeptide fragments of the invention may be of any size that is 
0 compatible with the invention. They may range in size from the 
smallest specific epitope (e.g., about 6 amino acids) to a nearly 
full-length gene product (e.g., a single amino acid shorter than 
SEQ ID Nos: 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 
or 44) . 
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The skilled person will appreciate that analogs of the 
polypeptides of the invention will also find use in the context 
of the present invention, i.e. as antigenic/immunogenic 
5 material. Thus, for instance proteins or polypeptides which 
include one or more additions, deletions, substitutions or the 
like are encompassed by the present invent ion . 

As used herein, "fragments", "analogs" or "derivatives" of the 
10 polypeptides of the invention include those polypeptides in 
which one or more of the amino acid residues are substituted 
with a conserved or non-conserved amino acid residue (preferably 
conserved) and which may be natural or unnatural. 

15 These substitutions are those having a minimal influence on the 
secondary structure and hydropathic nature of the polypeptide. 
Preferred substitutions are those known in the art as conserved, 
i.e. the substituted residues share physical or chemical 
properties such as hydrophobicity , size, charge or functional 

20 groups. These include substitutions such as those described by 
Dayhoff, M . in Atlas of Protein Sequence and Structure 5, 1S78 
and by Argos, P. in EMBO J. 8, 779-785, 1989. For example, amino 
acids, either natural or unnatural, belonging to one of the 
following groups represent conservative changes: 

25 ala, pro, gly, gin, asn, ser, thr, val; 
cys, ser, tyr, thr; 
val, ile, leu, met, ala, phe; 
lys, arg, orn, his; 
and phe, tyr, trp, his. 
30 The preferred substitutions also include substitutions of D- 
enantiomers for the corresponding L-amino acids - 

In one embodiment, preferred base substitutions are located 
where there is less sequence homology on Figures 31, 33 or 35. 
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In one embodiment, preferred amino acids substitutions are 
located where there is less sequence homology on Figures 32 , 34 
or 36. 

5 

The additional amino acid residues may be from a heterologous 
source or may be endogenous to the natural gene. 

In an alternative approach, the analogs could be fusion 
10 polypeptides, incorporating moieties which render purification 
easier, for example by effectively tagging the desired 
polypeptide. It may be necessary to remove the "tag" or it may 
be the case that the fusion polypeptide itself retains 
sufficient antigenicity to be useful. 

15 

The percentage of homology is defined as the sum of the 
percentage of identity plus the percentage of similarity or 
conservation of amino acid type. 

20 In one embodiment, analogs of polypeptides of the invention will 
have about 70% identity with those sequences illustrated in the 
figures or fragments thereof. That is, 70% of the residues are 
the same. In a further embodiment, polypeptides will have 
greater than 80% identity. In a further embodiment, polypeptides 
:5 will have greater than 85% identity. In a further embodiment, 
polypeptides will have greater than 90% identity. In a further 
embodiment, polypeptides will have greater than 95% identity. 
In a further embodiment, polypeptides will have greater than 98% 
identity. In a further embodiment, polypeptides will have 
3 greater than 99% identity. In a further embodiment, analogs of 
polypeptides of the invention will have fewer than about 20 
amino acid residue substitutions, modifications or deletions and 
more preferably less than 10. 
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In one embodiment, analogs of polypeptides of the invention will 
have about 70% similarity with those sequences illustrated in 
the figures or fragments thereof. That is, 70% of the residues 
are the same. In a further embodiment, polypeptides will have 
5 greater than 80% similarity. In a further embodiment, 
polypeptides will have greater than 85% similarity. In a further 
embodiment, polypeptides will have greater than 90% similarity. 
In a further embodiment, polypeptides will have greater than 95% 
similarity. In a further embodiment, polypeptides will have 
10 greater than 98% similarity. In a further embodiment, 
polypeptides will have greater than 99% similarity. In a 
further- embodiment, analogs of polypeptides of the invention 
will have fewer than about 20 amino acid residue substitutions, 
modifications or deletions and more preferably less than 10. . 

15 

In one embodiment, analogs of polypeptides of the invention will 
have about 70% homology with those sequences illustrated in,, the 
figures or fragments thereof. In a further embodiment, 
polypeptides will have greater than 80% homology. In a further 
0 embodiment, polypeptides will have greater than 8 5% homology. 
In a further embodiment, polypeptides will have greater than 90% 
homology. In a further embodiment, polypeptides will have 
greater than 95% homology. In a further embodiment, polypeptides 
will have greater than 98% homology. In a further embodiment, 
polypeptides will have greater than 99% homology. In a further 
embodiment, analogs of polypeptides of the invention will have 
fewer than about 20 amino acid residue substitutions, 
modifications or deletions and more preferably less than 10. 

One can use a program such as the CLUSTAL program to compare 
amino acid sequences. This program compares amino acid 

sequences and finds the optimal alignment by inserting spaces in 
either sequence as appropriate. It is possible to calculate 
amino acid identity or homology for an optimal alignment. A 
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program like BLASTx will align the longest stretch of similar 
sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are 
found, each having a different score. Both types of identity 
5 analysis are contemplated in the present invention. 

In an alternative approach, the analogs or derivatives could be 
fusion polypeptides, incorporating moieties which render 
purification easier, for example by effectively tagging the 
10 desired protein or polypeptide, it may be necessary to remove 
the "tag" or it may be the case that the fusion polypeptide 
itself retains sufficient antigenicity to be useful. 

It is well known that it is possible to screen an antigenic 
15 polypeptide to identify epitopic regions, i.e. those regions 
which are responsible for the polypeptide' s antigenicity or 
immunogenicity. Methods for carrying out such screening are 
well known in the art. Thus, the fragments of the present 
invention should include one or more such epitopic regions or. be 
20 sufficiently similar to such regions to retain their 
antigenic/immunogenic properties . 

The invention also encompasses polypeptides having a lower 
degree of sequence identity, but having sufficient similarity so 
25 as to perform one or more of the functions or activities 
exhibited by the native polypeptides . 

Thus, what is important for analogs, derivatives and fragments 
is that they possess at least a degree of the antigenicity/ 
10 immunogenicity of the protein or polypeptide from which they are 
derived. 

Also included are polypeptides in which one or more of the amino 
acid residues includes a substituent group. These polypeptides 
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include, e.g., modified polypeptides. Known polypeptide 

modifications include, but are not limited to, glycosylation, 
acetylation, acylation, ADP~ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, 
5 covalent attachment of a nucleotide or nucleotide derivative, 
covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphatidylinositol, cross-linking, cyclization, 
disulfide bond formation, demethylation, formation of covalent 
crosslinks, formation of cystine, formation of pyroglutamate, 

10 formylation, gamma carboxylat ion , glycosylation, GPI anchor 
format in, hydroxylation, iodination, methylation, myristoylation, 
oxidation, proteolytic processing, phosphorylation, prenylation, 
racemization, selenoylat ion , sulfation, transfer-RNA mediated 
addition of amino acids to proteins such as arginylation,. and 

15 ubiquitination . 

Such modifications are well-known to those of skill in the art 
and have been described in great detail in the scientific 
literature. Several particularly common modifications, 

20 glycosylation, lipid attachment, sulfation, gamma-carboxyiation 
of glutamic acid residues, hydroxylation . and ADP-ribosylation, 
for instance, are described in many basic texts, such as 
Proteins — Structure and Molecular Properties, 2nd ed . , T . E . 
Creighton, W.H. Freeman and Company, New York (1993). Many 

25 detailed reviews are available on this subject, such as by Wold, 
F., Posttranslationail Covalent Modification of Proteins, B.C- 
Johnson, Ed., Academic Press, New York 1-12 (1983); Seifter et 
al. (1990) Meth. Enzymol . 182:626-646 and Rattan et al. (1992) 
Ann. N.Y. Acad. Sci . 663:48-62. 

30 

Also included are polypeptides which have fused thereto other 
compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) to 
increase half-life; leader or secretory aimino acid sequences for 
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ease of purification; prepro- and pro- sequences; and 
(poly) saccharides . 



Furthermore, in those situations where amino acid regions are 
5 found to be polymorphic, it may be desirable to vary one or more 
particular amino acids to more effectively mimic the different 
epitopes of the different S. pyogenes strains. 

Moreover, the polypeptides of the present invention can be 
10 modified by terminal -NH 2 acylation (eg. by acetylation, or 
thioglycolic acid amidation, terminal carboxy amidation, e.g. 
with ammonia or methylamine) to provide stability, increased 
hydrophobicity for linking or binding to a support or other 
molecule. 

15 

Also contemplated are hetero and homo polypeptide multimers of 
the polypeptide fragments and analogues. These polymeric forms 
include, for example, one or more polypeptides that have been 
cross-linked with cross-linkers . such as . avidin/biot in 
10 gluteraldehyde or dimethylsuperimidate . Such polymeric forms 
also include polypeptides containing two or more tandem or 
inverted contiguous sequences, produced from multicistronic 
mRNAs generated by recombinant DNA technology. 

5 In a further embodiment, the present invention also relates to 
chimeric polypeptides which comprise one or more polypeptides or 
fragments or analogs thereof as defined in the . figures of the 
present application. 

) In a further embodiment, the present invention also relates to 
chimeric polypeptides comprising two or more polypeptides 
comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44 or fragments or analogs thereof; provided that 
the polypeptides are linked as to form a chimeric polypeptide. . 
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In a further embodiment, the present invention also relates to 
chimeric polypeptides comprising two or more polypeptides 
comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 
5 38, 40, 42, 44 provided that the polypeptides are linked as to 
form a chimeric polypeptide. 

Preferably, a fragment, analog or derivative of a polypeptide of 
the invention will comprise at least one antigenic region i.e. 
10 at least one epitope. 

In order to achieve the formation of antigenic polymers (i.e. 
synthetic multimers), polypeptides may be utilized having 
bishaloacetyl groups, nitroarylhalides, or the like, where the 
15 reagents being specific for thio groups. Therefore, the link 
between two mercapto groups of the different polypeptides may be 
a single bond or may be composed of a linking group of at least 
two, typically at least four, and not more than 16, but usually 
not more than about 14 carbon atoms. 

20 

In a particular embodiment, polypeptide fragments and analogs of 
the invention do not contain a starting residue, such as 
methionine (Met) or valine (Val). Preferably, polypeptides will 
not incorporate a leader or secretory sequence (signal 

25 sequence) . The signal portion of a polypeptide of the invention 
may be determined according to established molecular biological 
techniques. In general, the polypeptide of interest may be 
isolated from a pyogenes culture and subsequently sequenced 

to determine the initial residue of the mature protein and 

•0 therefore the sequence of the mature polypeptide. 

It is understood that polypeptides can be produced and/or used 
without their start codon (methionine or valine) and/or without 
their leader peptide to favor production and purification of 
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recombinant polypeptides. It is known that cloning genes without 
sequences encoding leader peptides will restrict the 
polypeptides to the cytoplasm of E . coli and will facilitate 
their recovery (Glick, B.R. and Pasternak, J.J. (1998) 
5 Manipulation of gene expression in prokaryotes. In "Molecular 
biotechnology: Principles and applications of recombinant DNA", 
2nd edition, ASM Press, Washington DC, p. 109-143). 

In another embodiment, the polypeptides of the invention may be 
JO lacking an N-terminal leader peptide, and/or a transmembrane 
domain and/or a C-terminal anchor domain. 

The present invention further provides a fragment of the 
polypeptide comprising substantially all of the extra cellular 
15 domain of a polypeptide which has at least 70% identify, 
preferably 80% identity, more preferably 95% identity, to a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 22, 24, 26, 28, 30, 
32, 34, 36,; 38, 40, 42, 44 or fragments or analogs thereof, over 
the entire length of said sequence. 

20 

According to another aspect of the invention, there are also 
provided (i) a composition of matter containing a polypeptide of 
the invention, together with a carrier, diluent or adjuvant; 
(ii) a pharmaceutical composition comprising a polypeptide of 

25 the invention and a carrier, diluent or adjuvant; (iii) a 
vaccine comprising a polypeptide of the invention and a carrier, 
diluent or adjuvant; (iv) a method for inducing an immune 
response against pyogenes , in a host, by administering to the 
host, an immunogenically effective amount of a polypeptide of 

30 the invention to elicit an immune response, e.g., a protective 
immune response to pyogenes ; and particularly, (v) a method 
for preventing and/or treating a pyogenes infection, by 

administering a prophylactic or therapeutic amount of a 
polypeptide of the invention to a host in need. 

30 
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According to another aspect of the invention, there are also 
provided (i) a composition of matter containing a polynucleotide 
of the invention, together with a carrier, diluent or adjuvant; 

5 (ii) a pharmaceutical composition comprising a polynucleotide of 
the invention and a carrier, diluent or adjuvant; (iii) a method 
for inducing an immune response against pyogenes , in a host, 
by administering to the host, an immunogenical ly effective 
amount of a polynucleotide of the invention to elicit an immune 

10 response, e.g., a protective immune response to pyogenes ; and 
particularly, (iv) a method for preventing and/or treating a S,. 
pyogenes infection, by administering a prophylactic or 
therapeutic amount of a polynucleotide of the invention to a 
host in need . 

15 

Before immunization, the polypeptides of the invention can also 
be coupled or conjugated to carrier proteins such as tetanus 
toxin, diphtheria toxin, hepatitis B virus surface antigen, 
poliomyelitis virus VP1 antigen or any other viral or bacterial 

20 toxin or antigen or any suitable proteins to stimulate the 
development of a stronger immune response. This coupling or 
conjugation can be done chemically or genetically. A more 
detailed description of peptide-carrier conjugation is available 
in Van Regenmortel, M.H.V., Briand J.P., Muller S., Plaue S., 

25 «Synthetic Polypeptides as antigens» in Laboratory Techniques in 
Biochemistry and Molecular Biology, Vol. 19 ted. ) Burdou, R.H. & 
Van Knippenberg P.H. (1988), Elsevier New York. 

According to another aspect, there are provided pharmaceutical 
30 compositions comprising one or more pyogenes polypeptides or 
chimeric polypeptides of the invention in a mixture with a 
pharmaceutically acceptable carrier or diluent. 
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According to another aspect, there are provided pharmaceutical 
compositions comprising one or more pyogenes polypeptides or 
chimeric polypeptides of the invention in a mixture with a 
pharmaceutically acceptable adjuvant. Suitable adjuvants include 
5 (1) oil-in-water emulsion formulations such as MF59™, SAP 1 * 1 , 
Ribi™ ; (2) Freund's complete or incomplete adjuvant; (3) salts 
i.e. A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ) 2 , A1(0H) 3 , A1P0 4 , silica, 
kaolin; (4) saponin derivatives such as Stimulon™ or particles 
generated therefrom such as ISCOMs ( immunost imulat ing 

10 complexes) ; (5) cytokines such as interleukins , interferons, 
macrophage colony stimulating factor (M-CSF) , tumor necrosis 
factor (TNF) ; (6) other substances such as carbon 
polynucleotides i.e. poly IC and poly AU, detoxified cholera 
toxin (CTB)and E.coli heat labile toxin for induction of mucosal 

15 immunity; and (7) liposomes. A more detailed description of 
adjuvants is available in a review by M.Z.I Khan et al. in 
Pharmaceutical Research, vol. 11, No. 1 (1994) pp2-ll, and also 
in another review by Gupta et al., in Vaccine, Vol. 13, No. 14, 
ppl263-1276 (1995) and in WO 99/24578. Preferred adjuvants 

20 include QuilA™, QS21™, Alhydrogel™ and Adjuphos™. 

Pharmaceutical compositions of the invention may be administered 
parenterally . by injection, rapid infusion, nasopharyngeal 
absorption, dermoabsorption, or buccal or oral. 

25 

The term pharmaceutical composition is also meant to include 
antibodies. In accordance with the present invention, there is 
also provided the use of one or more antibodies having binding 
specificity for the polypeptides of the present invention for 
JO the treatment or prophylaxis of streptococcus infection and/or 
diseases and symptoms mediated by streptococcus infection. 

Pharmaceutical compositions of the invention are used for the 
prophylaxis of pyogenes infection and/or diseases and 
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symptoms mediated by S^ pyogenes infection^ as described in 
Manual of Clinical Microbiology, P.R. Murray (Ed, in chief), E.J. 
Baron, M.A. Pfalier, F.C. Tenover and R.H. Yolken. ASM Press, 
Washington, D.C. seventh edition, 1999, 1773p. 

5 

In one embodiment, pharmaceutical compositions of the present 
invention are used for the prophylactic or therapeutic treatment 
or of many diverse infection types, including pharyngitis, 
erysipelas and impetigo, scarlet fever, and invasive diseases 
10 such as bacteremia and necrotizing fasciitis. 

In one embodiment, pharmaceutical compositions of the invention 
are used for the prophylactic or therapeutic treatment of 
Streptococcus infection and/or diseases and symptoms mediated by 
15 Streptococcus infection, in particular group A Streptococcus 
( Streptococcus pyogenes) , group B Streptococcus (GBS or S . 
agalactiae ) ,. S, pneumoniae , S . dysqalactiae , S . uberis , S . 
nocardia as well as Staphylococcus aureus . 

20 In one embodiment, pharmaceutical compositions of the invention 
are used for the prophylactic or therapeutic treatment of S. 
pyogenes infection and/or diseases and symptoms mediated by S . 
pyogenes infection. In a further embodiment, the pyogenes 
infection is nontypeable S_^ pyogenes . 

25 

Pharmaceutical compositions can also be specific or selective 
for one or more of the mentioned bacteria. Fox example, a 
composition can be selective or specific for a group A 
Streptococcus ( Streptococcus pyogenes ) and not react with group 
; 0 B Streptococcus ( GBS or £L agalactiae ) , S^ pneumoniae , S. 
dysgalactiae , S. uberis , S. nocardia nor Staphylococcus aureus . 

In a further embodiment, the invention provides a method for 
prophylactic or therapeutic treatment of streptococcus infection 
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in a host susceptible to streptococcus infect ion""" com{$j?i£irttj 1 5 
administering to said host a prophylactic or therapeutic amount 
of a composition of the invention. 

5 In a further embodiment, the invention provides a method for 
prophylactic or therapeutic treatment of pyogenes infection 

in a host susceptible to pyogenes infection comprising 

administering to said host a prophylactic or therapeutic amount 
of a composition of the invention. 

10 

As used in the present application, the term "host" includes 
mammals. In a further embodiment, the mammal is human. 

In a particular embodiment, pharmaceutical compositions are 
15 administered to those hosts at risk of pyogenes infection 

such as neonates, infants, children, elderly and 
immunocompromised hosts. 

In a particular embodiment, pharmaceutical compositions are 
0 administered to those hosts at risk of pyogenes infection 

such as adults. . . 

Pharmaceutical compositions are preferably in unit dosage form 
of about 0.001 to 100 jig/kg (antigen/body weight) and more 
"> preferably 0.01 to 10 fig /kg and most preferably 0.1 to . 1 fig/kg 1 
to 3 times with an interval of about 1 to 6 week intervals 
between immunizations. 

Pharmaceutical compositions are preferably in unit dosage form 
of about 0.1 fig to 10 mg and more preferably l^g to 1 mg and 
most preferably 10 to 100 jig 1 to 3 times with an interval of 
about 1 to 6 week intervals between immunizations. 
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According to another aspect, there are provided polynucleotides 
encoding polypeptides characterized by the amino acid sequence 
comprising SEQ ID No : 2, 4, 6, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44 or fragments or analogs thereof. 

5 

In one embodiment, polynucleotides are those illustrated in SEQ 
ID No: 1, 3, 5, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43 
which may include the open reading frames (ORF) , encoding the 
polypeptides of the invention. 

10 

Many types of variants of polynucleotides are encompassed by the 
invention including, e.g., (i) one in which one or more of the 
nucleotides is substituted with another nucleotide, or which is 
otherwise mutated; or (ii) one in which one or more of the 

15 nucleotides is modified, e.g., includes a subtituent group; or 
(iii) one in which the polynucleotide is fused with another 
compound, such as a compound to increase the half-life of the 
polynucleotide; or (iv) one in which additional nucleotides are 
covalently bound to the polynucleotide, such a sequences encoding 

20 a leader or secretory sequence or a sequence which is employed 
for purification of the polypeptide. The additional nucleotides 
may be from a heterologous source, or may be endogenous to the 
natural gene . 

25 Polynucleotide variants belonging to type (i) above include, 
e.g., polymorphisms, including single nucleotide polymorphisms 
(SNPs), and mutants. Variant polynucleotides can comprise, e.g., 
one or more additions, insertions, deletions, substitutions, 
transitions, transversions, inversions, or the like, or any 

30 combinations thereof. 

Polynucleotide variants belonging to type (ii) above include, 
e.g., modifications such as the attachment of detectable markers 
(avidin, biotin, radioactive elements, fluorescent tags and dyes, 
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energy transter labels, energy-emitting labels, binding partners, 
etc.) or moieties which improve expression, uptake, cataloging, 
tagging, hybridization, detection, and/or stability. The 
polynucleotides can also be attached to solid supports, e.g., 
5 nitrocellulose, magnetic or paramagnetic microspheres (e.g., as 
described in U.S. Pat. No. 5,411,863; U.S. Pat. No. 5,543,289; 
for instance, comprising ferromagnetic, supermagnet ic, 
paramagnetic, superparamagnetic, iron oxide and polysaccharide) , 
nylon, agarose, diazotized cellulose, latex solid microspheres, 
10 polyacrylamides, etc., according to a desired method. See, e.g., 
U.S. Pat. Nos. 5,470,967; 5, 47 6, 925 ; 5, 478 , 893 . 

Polynucleotide variants belonging to type (iii) above are well 
known in the art and include, e.g.., various lengths of polyA 4 
15 tail, 5' cap structures, and nucleotide analogs, e.g., inosine, 
thionucleotides, or the like. 

Polynucleotide, variants belonging to type (iv) above include, 
e.g., a variety of chimeric, hybrid or fusion polynucleotides. 

20 For example, a polynucleotide of the invention can comprise a 
coding sequence and additional non-naturally occurring or 
heterologous coding sequence (e.g., sequences coding for leader, 
signal, secretory, targeting, enzymatic, fluorescent, antibiotic 
resistance, and other functional or diagnostic peptides); or a 

25 coding sequence and non-coding sequences, e.g., untranslated 
sequences at either a 5' or 3' end, or dispersed in the coding 
sequence. 

It will be appreciated that the polynucleotide sequences 
JO illustrated in the figures may be altered with degenerate codons 
yet still encode the polypeptides of the invention. Accordingly 
the present invention further provides polynucleotides which 
hybridize to the polynucleotide sequences herein above described 
(or the complement sequences thereof) having 70% identity 
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between sequences. In one embodiment, at least 80% identity 
between sequences. In one embodiment, at least 85% identity 
between sequences. In one embodiment, at least 90% identity 
between sequences. In a further embodiment, polynucleotides are 
5 hybridizable under stringent conditions i.e. having at least 95% 
identity. In a further embodiment, more than 97% identity: In a 
further embodiment, more than 98% identity. In a further 
embodiment, more than 99% identity. In a further embodiment, 
polynucleotides are hybridizable under stringent conditions. 

10 

Suitable stringent conditions for hybridization can be readily 
determined by one of skilled in the art (see for example 
Sambrook et al., • (1989) Molecular cloning : A Laboratory Manual, 
2 nd ed, Cold Spring Harbor, N . Y . ; Current Protocols in Molecular 
15 Biology, (1999) Edited by Ausubel F.M. et al., John Wiley & 
Sons, Inc . , N . Y . ) . 

"Suitable stringent conditions", as used herein, means, for 
example, incubating a blot overnight (e.g., at least 12 hours) 

20 with a long polynucleotide probe in a hybridization solution 
containing, e.g., about 5X SSC, 0.5% SDS, 100 pg/ml denatured 
salmon sperm DNA and 50% formamide, at 42 °C. Blots can be 
washed at high stringency conditions that allow, e.g., for less 
than 5% bp mismatch (e.g., wash twice in 0 . IX SSC and 0.1% SDS 

25 for 30 min at 65°C) , thereby selecting sequences having, e.g. , 
95% or greater, sequence identity. 

Other non-limiting examples of suitable stringent conditions 
include, a final wash at 65°C in aqueous buffer containing 30 mM 
30 NaCl and 0.5% SDS. Another example of suitable stringent 
conditions is hybridization in 7% SDS, 0.5 M NaP0 4 , pH 7, 1 mM 
EDTA at 50°C, e.g., overnight, followed by one or more washes 
with a 1% SDS solution at 42°C. Whereas high stringency washes 
can allow for less than 5% mismatch, reduced or low stringency 
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conditions can permit up to 20% nucleotide mismatch. 
Hybridization at low stringency can be accomplished as above, 
but using lower formamide conditions, lower temperatures and/or 
lower salt concentrations, as well as longer periods of 
5 incubation time. 

According to another aspect of the invention, there are also 
provided isolated polynucleotides comprising a sequence that 
hybridize under stringent conditions to either 
10 a) a DNA sequence encoding a polypeptide or 

b) the complement of a DNA sequence encoding a polypeptide; 

wherein said polypeptide comprises SEQ ID NO: 2, 4, 6, 
22, 24, ; 26, 28, 30, 32,. 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof. 

15 

In a further 'embodiment, the present invention provides 
polynucleotides that hybridize under stringent conditions to 
either 

(a) a DNA sequence encoding a polypeptide or 
20 (b) the complement of a DNA sequence encoding a 

polypeptide; - 
wherein said polypeptide comprises SEQ ID NO: 2, 4, 6, 22, 24, 
26, 28, 30, 32, 34, 36, 38, 40, 42 or 44. 

!5 According to another, aspect of the invention, there are also 
provided isolated polynucleotides comprising a sequence that 
hybridize under stringent conditions to either 

a) a DNA sequence encoding a polypeptide or 

b) the complement of a DNA sequence encoding a polypeptide; 
0 wherein said polypeptide has SEQ ID NO: 2, 4, 6, 22, 

24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or 
fragments or analogs thereof. 
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In a further embodiment, the present invention provides" 
polynucleotides that hybridize under stringent conditions to 
either 

(a) a DNA sequence encoding a polypeptide or 
5 (b) the complement of a DNA sequence encoding a 

polypeptide; 

wherein said polypeptide has SEQ ID NO: 2, 4, 6, 22, 24, 26, 28, 
30, 32, 34, 36,. 38, 40, 42 or 44. 

10 In a further embodiment, the present invention provides 
polynucleotides that hybridize under stringent conditions to 
either 

(a) a DNA sequence encoding a polypeptide or 

(b) the complement of a DNA sequence encoding a 
15 polypeptide; 

wherein said polypeptide comprises at least 10 contiguous amino 
acid residues from a polypeptide comprising SEQ ID No : 2, 4, 6, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44 or fragments or 
analogs thereof. 

20 

In a further embodiment, the present invention provides 
polynucleotides that hybridize under stringent conditions to 
either 

(a) a DNA sequence encoding a polypeptide or 
25 (b) the complement of a DNA sequence encoding a 

polypeptide; 

wherein said polypeptide comprises at least 10 contiguous amino 
acid residues from a polypeptide comprising SEQ ID No : 2, 4, 6 
22, 24, 26, 28, 30, 32, 34, 36, 38,. 40, 42 or 44. 

JO 

In a further embodiment, the present invention provides 
polynucleotides that hybridize under stringent conditions to 
either 

a) a DNA sequence encoding a polypeptide or 
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b) tne complement ot a DNA sequence encoding ~a~ 

polypeptide; 

wherein said polypeptide has at least 10 contiguous amino acid 
residues from a polypeptide having SEQ ID No : 2, 4 , 6, 22, 24, 
5 26, 2d, 30, 32, 34, 36, 38, 40, 42, 44 or fragments or analogs 
thereof . 

In a further embodiment, the present invention provides 
polynucleotides that hybridize under stringent conditions to 
10 either 

(a) a DNA sequence encoding a polypeptide or 

(b) the complement of a DNA sequence encoding a 
polypeptide; ( 

wherein said polypeptide has at least 10 contiguous amino acid 
15 residues from a polypeptide having SEQ ID No : 2, 4, 6 22, 24, 
26, 28, 30, 32, 34, 36, 38, 40, 42 or 44. 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO: 1, 3, 5, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 
20 41, 4 3 or fragments or analogs thereof encoding polypeptides of 
the invention. 

In a further embodiment, polynucleotides are those illustrated 
in SEQ ID NO: 1, 3, 5, 21, 23, 25, 27, 29, 31, 33, 35,. 37, 39, 
25 41, 43 encoding polypeptides of the invention. 

As will be readily appreciated by one skilled in the art, 
polynucleotides include both DNA and RNA. 

30 The present invention also includes polynucleotides 
complementary to the polynucleotides described in the present 
application. 
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In a lurtner aspect, polynucleotides encoding polypeptides of 
the invention, or fragments, analogs or derivatives thereof, may 
be used in a DN7A immunization method. That is, they can be 
incorporated into a vector which is replicable and expressible 
5 upon injection thereby producing the antigenic polypeptide in 
vivo. For example polynucleotides may be incorporated into a 
plasmid vector under the control of the CMV promoter which is 
functional, in eukaryotic cells. Preferably the vector is 
injected intramuscularly. 

10 

According to another aspect, there is provided a process for 
producing polypeptides of the invention by recombinant 
techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
15 polypeptide product. 

Alternatively, the polypeptides can be produced according to 
established synthetic chemical techniques i.e. solution phase or 
solid phase synthesis of oligopeptides which are ligated to 
20 produce the full polypeptide (block ligation) . 

General methods for dbtention and evaluation of polynucleotides 
and polypeptides are described in the following references: 
Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd ed, 

25 Cold Spring Harbor, N.Y., 1989; Current Protocols in Molecular 
Biology, Edited by Ausubel F.M. et al., John Wiley and Sons, 
Inc. New York; PCR Cloning Protocols, from Molecular Cloning to 
Genetic Engineering, Edited by White B. A., Humana Press, Totowa, 
New Jersey, 1997, 490 pages; Protein Purification, Principles 

30 and Practices, Scopes R.K., Springer-Verlag, New York, 3rd 
Edition, 1993, 380 pages; Current Protocols in Immunology, 
Edited by Coligan J.E. et al., John Wiley & Sons Inc., New York. 
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The present invention provides a vector comprising a 
polynucleotide of the invention , wherein said DNA is operably 
linked to an expression control region, 

5 The present invention provides a host cell transfected with the 
vector of the invention. 

The present invention provides a process for producing a 
polypeptide comprising culturing a host cell of the invention 
10 under conditions suitable for expression of said polypeptide. 

For recombinant production, host cells aire transfected with 
vectors which encode the polypeptides of the invention, and then 
cultured in a nutrient media modified as appropriate for 

15 activating promoters, selecting transf ormants or amplifying the 
genes. Suitable vectors are those that are viable and replicable 
in the chosen host and include chromosomal, non-chromosomal and 
synthetic DNA sequences e.g. bacterial plasmids, phage DNA, 
baculovirus, yeast plasmids, vectors derived from combinations 

20 of plasmids and phage DNA. The polypeptide sequence may be 
incorporated in the vector at the appropriate site using 
restriction enzymes such that it is operably linked to an 
expression control region comprising a promoter, ribosome 
binding site (consensus region or Shine-Dalgarno sequence), and 

25 optionally an operator (control element) . One can select 
individual components of the expression control region that are 
appropriate for a given host and vector according to established 
molecular biology principles (Sambrook et al, Molecular Cloning: 
A Laboratory Manual, 2nd ed, Cold Spring Harbor, N.Y., 1989; 

30 Current Protocols in Molecular Biology, Edited by Ausubel F.M. 
et al., John Wiley and Sons, Inc. New York). Suitable promoters 
include but are not limited to LTR or SV40 promoter, E.coli lac, 
tac or trp promoters and the phage lambda P L promoter. Vectors 
will preferably incorporate an origin of replication as well as 
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selection markers i.e. ampicilin resistance^ gene. Suitable 
bacterial vectors include pET, pQE70, pQE60, pQE-9, pDIO 
phagescript, psiX174, pbluescript SK, pbsks, pNH6A, pNH16a, 
pNH18A, pNH46A, ptrc99a f pKK223-3, pKK233-3, pDR540, pRIT5 and 
5 eukaryotic vectors pBlueBacIII, pWLNEO, pSV2CAT, pOG44, pXTl , 
pSG, pSVK3, pBPV, pMSG and pSVL. Host cells may be bacterial 
i.e. E . coli , Bacillus subtilis , Stre'ptomyces ; fungal i.e. 
Aspergillus niger , Aspergillus nidulins ; yeast i.e. 
Saccharomyces or eukaryotic i.e. CHO, COS. 

10 

Upon expression of the polypeptide in culture, cells are 
typically harvested by centri'f ugation then disrupted by physical 
or chemical means (if the expressed polypeptide is not secreted 
into the media) and the resulting crude extract retained to 

15 isolate the polypeptide of interest. Purification of the 
polypeptide from culture media or lysate may be achieved by 
established techniques depending on the properties of the 
polypeptide i.e. using ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange 

20 chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, hydroxylapatite chromatography and 
lectin chromatography. Final purification may be achieved using 
HPLC. 

25 The polypeptides may be expressed with or without a leader or 
secretion sequence. In the former case the leader may be 
removed using post-translational processing (see US 4,431,739; 
US 4,425,437; and US 4, 338, 397) or be chemically removed 
subsequent to purifying the expressed polypeptide. 

30 

According to a further aspect, the S^ pyogenes polypeptides of 
the invention may be used in a diagnostic test for Streptococcus 
infection, in particular S^ pyogenes infection. 
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Several diagnostic methods are possible, for example detecting 
S . pyogenes organism in a biological sample, the following 
procedure may be followed: 

a) obtaining a biological sample from a host; 
5 b) incubating an antibody or fragment thereof reactive 

with a pyogenes polypeptide of the invention with the 

biological sample to form a mixture; and 

c) detecting specifically bound antibody or bound 
fragment in the mixture which indicates the presence of 
10 pyogenes . 

Alternatively, a method for the detection of antibody specific 
to a pyogenes antigen in a biological sample containing 6r 

suspected of containing said antibody may be performed as 
] 5 follows : 

a) obtaining a biological sample from a host; 

b) incubating one or more pyogenes polypeptides of the 
invention or fragments thereof with the biological sample, 
to form a mixture; and 

20 c) detecting specifically bound antigen or bound fragment 

in the mixture which indicates the presence of antibody 
specific to pyogenes . 

One of skill in the art will recognize that this diagnostic test 
25 may take several forms, including an immunological test such as 
an enzyme-linked immunosorbent assay (ELISA) , a radioimmunoassay 
or a latex agglutination assay, essentially to determine whether 
antibodies specific for the polypeptide are present in an 
organism . 

SO 

The ■' DNA sequences encoding polypeptides of the invention may 
also be used to design DNA probes for use in detecting the 
presence of pyogenes in a biological sample suspected of 
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containing such bacteria. The detection method of this 

invention comprises : 

a) obtaining the biological sample from a host; 

b) incubating one or more DNA probes having a DNA 
5 sequence encoding a polypeptide of the invention or 

fragments thereof with the biological sample to form a 
mixture; and 

c) detecting specifically bound DNA probe in the mixture 
which indicates the presence of S. pyogenes bacteria. 

10 

The DNA probes of this invention may also. be used for detecting 
circulating pyogenes i.e. pyogenes nucleic acids in a 

sample, for example using a polymerase chain reaction, as a 
method of diagnosing S . pyogenes infections. The probe may be 

15 synthesized using conventional techniques and may be immobilized 
on a solid phase, or may be labelled with a detectable label. A 
preferred DNA probe for this application is an oligomer having a 
sequence complementary. to at least about 6 contiguous 
nucleotides of the pyogenes polypeptides of the invention. In 

10 a further embodiment, the preferred DNA probe will be an 
oligomer having a sequence complementary to at least about 15 
contiguous nucleotides of the pyogenes polypeptides of the 

invention. In a further embodiment, the preferred DNA probe will 
be an oligomer having a sequence complementary to at least about 
5 30 contiguous nucleotides of the pyogenes polypeptides of the 
invention. In a further, embodiment, the preferred DNA probe will 
be. an oligomer having a sequence complementary to at least about 
50 contiguous nucleotides of the pyogenes polypeptides of the 
invention . 

D 

Another diagnostic method for the detection of S_^ pyogenes in a 
host comprises: 

a) labelling an antibody reactive with a polypeptide of 
the invention or fragment thereof with a detectable label; 
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b) administering the labelled antibody or labelled' 
fragment to the host; and 

c) detecting specifically bound labelled antibody or 
labelled fragment in the host which indicates the presence 

5 of pyogenes . 

A further aspect of the invention is the use of the pyogenes 
polypeptides of the invention as immunogens for the production 
of specific antibodies for the diagnosis and in particular the 

10 treatment of £L pyogenes infection. Suitable antibodies may be 
determined using appropriate screening methods, for example by 
measuring the ability of a particular antibody to passively 
protect against S_^ pyogenes infection in a test model. One 
example of an animal model is the mouse model described in the 

15 examples herein. The antibody may be a whole antibody or an 
antigen-binding fragment thereof and may belong to any 
immunoglobulin class. The antibody or fragment may be of animal 
origin, specif ically of mammalian origin and more specifically 
of murine, rat or human origin. It may be a natural antibody or 

20 a fragment thereof, or if desired, a recombinant antibody or 
antibody fragment. The term recombinant antibody or antibody 
fragment means antibody or antibody fragment which was produced 
using molecular biology techniques. The antibody or antibody 
fragments may be polyclonal, or preferably monoclonal. It may 

15 be specific for a number, of epitopes associated with the S. 
pyogenes polypeptides but is preferably specific for one. 

According to one aspect, the present invention provides the use 
of an antibody for treatment and/or prophylaxis of pyogenes 
0 infections. 

A further aspect of the invention is the use of the antibodies 
directed to the polypeptides of the invention for passive 
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immunization. One could use the antibodies described in the 
present application . 

A further aspect of the invention is a method for 
5 whereby an antibody raised by a polypeptide of the 
administered to a host in an amount sufficient 
passive immunization . 

The use of a polynucleotide of the invention in genetic 
W immunization will preferably employ a suitable delivery method 
or system such as direct injection of plasmid DNA into muscles 
[Wolf et al. HM G (1992) 1: 363; Turnes et al . , Vaccine (1999), 
17 : 2089; Le et al., Vaccine (2000) 18 : 1 8 93 ; Alves et al . , 
Vaccine (2001) 19 : 788], injection of plasmid DNA with or 
15 without adjuvants [Ulmer -et al., Vaccine (1999) 18: 18; 
MacLaughlin et al . , J, Control Release (1998) 56: 259; Hartikka 
et al.., Gene Ther. (2000) 7: 1171-82; Benvenisty and Reshef, 
PNAS USA (1986) 8 3 : 9551 ; : Singh et al . , PNAS USA (2000) 97: 811], 
targeting cells by delivery of DNA complexed with specific 
!0 carriers IWa et al.,- J Biol Chem (1989) 264: 16985; Chaplin et 
al., Infect. Immun. (1999) 67: 6434], injection of plasmid 
complexed or encapsulated in various forms of liposomes [Ishii 
et al. , AIDS Research and Human Retroviruses (1997) 13: 142; 
Perrie et al., Vaccine (2001) 19: 3301] , ; administration of DNA 
5 with different methods of bombardment [Tang et al., Nature 
(1992); 356: 152; Eisenbraun et al., DNA Cell Biol (1993) 12: 
791; Chen et al.. Vaccine (2001) 19: 2908], and administration 
of DNA with lived vectors [Tubulekas et al., Gene (1997) 190: 
191; Pushko et al., Virology (1997) 239: 389; Spreng et al., FEMS 
) (2000) 27: 299; Dietrich et al., Vaccine (2001) 19: 2506]. 

In a further embodiment, the invention provides the use of a 
pharmaceutical composition of the invention in the manufacture 



immunization ,. 

invention is 
to provide a 
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of a medicament for the prophylactic or therapeutic treatment of 
S . pyogenes infection. 

In a further embodiment, the invention provides a kit comprising 
5 a polypeptide of the invention for detection or diagnosis of S . 
pyogenes infection. 

Unless otherwise defined, all technical and scientific terms 
used herein have the same meaning as commonly understood by one 

10 of ordinary skill in the art to which this invention belongs. 
All publications, patent applications, patents, and other 
references mentioned herein are incorporated by reference in 
their entirety. In case of conflict, the present specification, 
including definitions, will control. in addition, the 

15 materials, methods, and examples are illustrative only and not 
intended to be limiting. 

EXAMPLE 1 

This example illustrates the cloning and molecular 
20 characteristics of SHB-GAS-102 gene and corresponding 
polypeptide 

The coding region of pyogenes SHB-GAS-102 (SEQ ID NO: 1) gene 
was amplified by PCR (Robocycler Gradient 96 Temperature cycler, 
25 Stratagene, La Jolla, CA) from genomic DMA of serotype Ml S. 
pyogenes strain ATCC700294 using the following oligonucleotide 
primers that contained base extensions for the addition of 
restriction sites Ndel (CATATG) and Xhol (CTCGAG) : DMAR2174 and 
DMAR2175, which are presented in Table 1. PCR products were 
30 purified from agarose gel using a QIAquick gel extraction kit 
from QIAgen following. the manufacturer's instructions 

(Chatsworth, CA) , and digested with Ndel and Xhol (Amersham 
Biosciences Inc, Baie d'Urfe, Canada). The pET-19b(+) vector 

(Novagen, Madison, WI) was digested with Ndel and Xhol and 
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purified from agarose gel using a QIAquick gel extraction kit 
from QIAgen (Chatsworth, CA) . The Ndel-Xhol PCR products were 
ligated to the Ndel-Xhol pET-19b ( + ) expression vector. The 
ligated products were transformed into coli strain DH5a 

5 [<|)80dIacZAM15 A ( lacZYA-argF) Ul 69 endhl recAl hsdRll (r K -rr\ K +) deoR 
thi-1 supE4 4 X'gyrA96 reiAl] (Gibco BRL, Gaithersburg , MD) 
according to the method of Simanis (Hanahan, D. DNA Cloning, 
1985, D.M. Glover (ed) , pp. 109-135). Recombinant pET-19b(+) 
plasmid (rpET19b ( + ) ) containing SHB-GAS-1-02 gene was purified. 

10 using the GenElute plasmid kit (Sigma-Aldrich Company Ltd, MO) 
and DNA insert was sequenced (Taq Dye Deoxy Terminator Cycle 
Sequencing kit, ABI, Foster City, CA) . 

15 Table 1. Oligonucleotide primers used for PCR amplifications of 



S . pyogenes genes 



nes 


Primers 


Res trie 


Vector 


Sequence 


DNA 


SEQ 


I.D. 


-tion 






modifica- 


ID 


CD 




site 






tion 1 


NO 




DMAR2 1 7 4 


Ndel 


PET 19b 


5' - 


None 


7 


SHB-GAS- 








GAGAAAATACATATG 
TCACGTATTGGTAAT 
AAAG-3' 








DMAR217 5 


Xhol 


PET19b 


5' - 

CCCTCGAGTTATTTA 
CCTGTTTTACCTTC- 
3' 


None 






DMAR2174 


BamHI 


pCMV-GH 


5' - 


None 


9 




a 






AAGGATCCCATGTCA 
CGTATTGGTAATAAA 
G-3' 
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Genes 

1 


Primers 


Restric 


Vector 


Sequence 


DNA 


SEQ 


I.D. 


-tion 






modifica- 


ID 




site 






tion 1 


NO 




DMAR217 5 


Sal J 


pCMV~GH 


5' - 


None 


10 




a 






ACTAGTCGACTTATT 
TACCTGTTTTACCTT 
CTTTAAGG-3' 








DMAR284 1 


n . a . 


n . a . 


5' - 


466 - 


11 




2 






CCTTACAAAGGCAAA 
GGCATCCGTTACGTT 
GGTGA-3' 


CCTTACAAA 
GGCAAAGGC 
ATCCGT - 






DMAR2 84 2 


ri . a . 


n , a . 


5' - 


4 89 


12 




2 






TCACCAACGTAACGG 
ATGCCTTTGCCTTTG 
TAAGG-3' 








DMAR187 8. 


Ndel 


PET19b 


5' - 

TGTGTGGTTCATATG 
AGCTACTTGATAATG 
- 3' 


None 


13 




DMAR187 9 


Xhol 


PET19b 


5' 

CCCTCGAGTTAAGGT 
TTAACAATACTTCC 
- 3' 


None 


14 




DMAR18 78 


BamHI 


PCMV-GH 


5' - 


None 


15 




a 






GGGGATCCCTTGATA 
ATGAACCATCAAC - 
3' 








DMAR187 9 


Sail 


PCMV-GH 


5' - 


None 


16 


CO 
O 
i-H 


a 






CCGTCG ACGGT T T AA 






-GAS- 








CAATACTTCCTAC - 
3' 






SHB' 
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mes 


Primers 


Restric 


Vector 


Sequence 


DMA 


SEQ 


I.D. 


-tion 






modifica- 


ID 






site 






tion 1 


NO 




DMAR197 6 


Ndel 


PET 19b 


5' - 

CTTTTTGGTACATAT 
GGTGAATCAGCACCC 
TAA- 3' 


None 


17 




DMARiy / / 




pet. i yo 


r t 

o — 

CCCTCGAGTTACGGA 
TGATCTCCCAC- 3/ 


None 
■ 


J. O 




DMAR1 y / b 




pUM V — (oH 


p — 


None 


1 Q 




a 






CACCCTAAAACGG- 






-104 








*V' 






CO 

< 


DMAR197 7 


Sail 


pCMV-GH 


5' - 


None 


20 










CCGTCGACGGATGAT 






CQ 

tc 


a 






CTCCCACGTGGTC- 






CO 








3' 







x The underlined amino acid residue represents the modification 
in the DNA sequence. 

2 PCR oligonucleotide primer sets used to remove the BamKL 
restriction: site present in the SHB-GAS-102 gene. 

5 

It was determined that the 537-bp including a stop codon <TAA) 
open reading frame (ORF) of SHB-GAS-102 encodes a 178 amino-acid 
residues polypeptide with a predicted pi of 9.55 and a predicted 
molecular mass of 19,431.0 Da. Analysis of the predicted amino 
10 acid residues sequence (SEQ ID NO: 2) using the Spscan sofware 
(Wisconsin Sequence Analysis Package; Genetics Computer Group) 
did not reveal the existence of a signal peptide. 

To confirm the presence by PCR amplification of SHB-GAS-102 <SE<2 
15 ID NO :1) gene, the following 4 serologically distinct S . 
pyogenes strains were used: the serotype Ml pyogenes strain 

ATCC7O0294, the serotype M3 pyogenes strain ATCC123S4, and 
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the serotype M18' pyogenes strain ATCC12357 were obtained from 
the American Type Culture Collection (Rockville, MD) , and the 
serotype M6 pyogenes SPY 67 clinical isolate was provided by 
the Centre de recherche en infect iologie du Centre hospitalier 
5 de 1'Universite Laval, Sainte-Foy, Canada. Chromosomal DNA was 
isolated from each strain as previously described (Jayarao BM et 
al. 1991. J. Clin. Microbiol. 29:2774-2778). SHB-GAS-102 (SEQ ID 
NO :1) gene was amplified by PCR (Robocycler Gradient 96 
Temperature cycler) from the genomic DNA purified from the A S. 

10 pyogenes strains using the oligonucleotide primers DMAR2174 and 
DMAR2175 (Table 1) . PCR was performed with 35 cycles of 45 sec 
at 95°C, 45 sec at 45°C and 75 sec at 72°C and a final 
elongation period of 10 min at 72°C. The PCR products were size 
fractionated in 1% agarose gels and were visualized by ethidium 

15 bromide staining. The results of these PCR amplifications are 
presented in Table 2. The analysis of the amplification 
products revealed that SHB-GAS-102 (SEQ ID NO :1) gene was 
present in the genome of all of the 4 pyogenes strains 

tested. These PCR data presented in the previous paragraphs 

20 clearly demonstrated that the SHB-GAS-102 gene is highly 
conserved among streptococcal strains. 



Table 2. Identification of pyogenes genes by PCR 

amplification 



Strain 
Identification 


Identification by PCR amplification of 




SHB-GAS-102 


SHB-GAS-103 


SHB-GAS-104 


ATCC700294 (Ml) 


+ 




+ 


ATCC12384 (M3) 


+ 


+ 


+ 


SPY67 (M6) 




+ 


+ 


ATCC12357 (M18) 




+ 


+ 



5 



In order to evaluate the distribution of the SHB-GAS-102 
polypeptide among pyogenes isolates, the reactivity of pooled 
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mouse anti-SHB-GAS-102 sera to a collection ot 13 strains ot S. 
pyogenes representing 13 1X1 serotypes (Table 3) was tested by 
immunoblots . These sera were collected from mice immunized three 
times at two-week intervals with 20 pg of purified recombinant 
5 SHB-GAS-102 polypeptides mixed with Quil A adjuvant. The 
recombinant SHB-GAS-102 polypeptide was produced and purified as 
described in Example 6. The pyogenes cells used for this 

study were prepared following this protocol: bacteria were grown 
in Todd Hewitt (TH) broth with 0.5% Yeast extract and 1% peptone 
10 extract overnight at 37°C in a 8% C0 2 atmosphere. The bacterial 
suspension was adjusted to an OD 6 oonm of 1.0 and the pyogenes 
cells were fixed with 0.3% formaldehyde in PBS buffer for 18 h 
at 4°C. After incubation, this solution was treated with 
mutanolysin (7. 5U/500pl) for 30 min at 37°C, and boiled for 5 
15 min in SDS^PAGE sample buffer. Antibodies present in the pooled 
mouse sera recognized the SHB-GAS-102 polypeptide band (-22 kDa) 
in all the 13 S^ pyogenes preparations tested {Table 3). In 
order to confirm the nature of the polypeptide band, anti-SHB- 
GAS-102 sera were adsorbed with 12.5 pg of purified recombinant 
20 SHB-GAS-102 polypeptides (overnight, 4°C) . As expected, this 
adsorbed sera did not recognize, in the S^ pyogenes whole-cell 
extract applied onto the gel, the polypeptide band at «22 kDa 
confirming that this polypeptide band was the SHB-GAS-102 
polypeptide;. 

25 

These data as well as the data presented in the previous 
paragraphs: clearly demonstrated that the SHB-GAS-102 gene and 
the SHB-GAS-102 polypeptide are highly conserved among 
streptococcal strains . 

30 

Table 3. Identification of pyogenes polypeptides by 

immunoblots with pooled mouse anti-sera raised against 
recombinant SHB-GAS-102, SHB-GAS-103 and SHB-104 polypeptides 1 . 



53 



WO 2005/017093 



PCT/CA2004/001510 



Strain 
identification 
(M serotype) 


Identification by immunoblot of 




SHB-GAS-102 


SHB-GAS-103 


SHB-GAS-104 


ATCC700294 
(Ml) 


+ 


+ 


+ 


Spy68 (M2) 


4 


+ 


+ 


ATCC12384 (M3) 


+ 


+ 


+ 


Spy71 (M4) 


+ 


+ 


+ 


Spy70 <M5) 


+ 




+ 


Spy67 (M6) 


+ 


+ 


+ 


Spy88 (Mil) . 






■+ 


Spy 95 (M12) 


+ 






ATCC12357 
(M18) 


+ 


+ 


+ 


Spy73 (M22) 


■ . + 


' + 


+ 


Spy91 (M28) 


+ 




+ 


Spy99 (M58) 


+ 


.+ 




Spy87 (M77) 


+ 


+ 


+ 



x Sera were collected from mice immunized three times at two-week 
intervals with 20 pg of purified recombinant polypeptides mixed 
with Quil A adjuvant. Sera were diluted 1/1000 to perform the 
immunoblots. 

5 

EXAMPLE 2 

This example illustrates the cloning and molecular 
characteristics of SHB-GAS-103 gene and corresponding 
polypeptide 

3 

The coding region of S^ pyogenes SHB-GAS-103 gene (SEQ ID NO: 3) 
was amplified by PCR (Robocycler Gradient 96 Temperature cycler, 
Stratagene, LaJolla, Ca) from genomic DNA of serotype Ml S ■ 
pyogenes strain ATCC70O294 using the following oligos that 
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contained base extensions for the addition of restRCJ^% jg^R 0^ii/t@^) 1 \ 
A/del (CATATG) and Xhol (CTCGAG) : DMAR1878 and DMAR1879, which 
are presented in Table 1. The methods used for cloning SHB-GAS- 
103 into an expression vector and sequencing are similar to the 
methods described in Example 1. 

It was determined that the 1269~bp including a stop codon (TAA) 
open reading frame (ORF) of SHB-GAS-103 encodes a 422 amino- 
acid-residues polypeptide with a predicted pi of 9.11 and a 
predicted molecular mass of 46, 605.3 Da. Analysis of the 
predicted amino acid residues sequence (SEQ ID NO: 4) using the 
Spscan sofware (Wisconsin Sequence Analysis Package; Genetics 
Computer Group) suggested the existence of a 27 amino acid 
residues signal peptide (MFQLRKKMTRKQLALLSAGVLTCWGG) . 

The SHB-GAS-103 gene was shown to be present after PCR 
amplification using the oligonucleotide primers DMAR1878 and 
DMAR1879 in the 4 serologically pyogenes strains tested 

(Table 2). In addition, the SHB-GAS-103 polypeptide was shown to 
be present in all the 13 pyogenes isolates tested by 

immunoblots as described in Example 1 (Table 3). Indeed, 
antibodies present in the pooled mouse sera recognized the SHB- 
GAS-103 polypeptide band («52 kDa) in all the 13 pyogenes 
preparations tested (Table 3) . In order to confirm the nature of 
the polypeptide band, anti-SHB-GAS-103 sera were adsorbed with 
12.5 pg of purified recombinant SHB-GAS-103 polypeptides 
(overnight, . 4°C) . As expected, this adsorbed sera did not 
recognize, in the S_^ pyogenes whole-cell extract applied onto 
the gel, the polypeptide band . at «52 kDa confirming that this 
polypeptide band was the SHB-GAS-103 polypeptide. 

These data presented in the previous paragraphs clearly 
demonstrated that the SHB-GAS-103 gene and the SHB-GAS-103 
polypeptide are highly conserved among streptococcal strains. 
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EXAMPLE 3 

This example illustrates the cloning and molecular 
characteristics of SHB-GAS-104 gene and corresponding 
polypeptide 

The coding region of pyogenes SHB-GAS-104 gene (SEQ ID NO: 5) 
was amplified by PCR (Robocycler Gradient 96 Temperature cycler, 
Stratagene, LaJolla, Ca) from genomic DNA of serotype Ml S . 
pyogenes strain ATCC700294 using the following oligos that 
contained base extensions for the addition of restriction sites 
Ndel (CATATG) and Xhol (CTCGAG) : DMAR197 6 and DMAR1977, which 
are presented in Table 1. The methods used for cloning SHB-GAS- 
104 into an expression vector and sequencing are similar to the 
methods described in Example 1. 

It was determined that the 885-bp including a stop codon (TAA) 
open reading frame (ORF) of SHB-GAS-104 encodes a 294 amino- 
acid-residues polypeptide with a predicted pi of 5.83 and a 
predicted molecular mass of 33,381.9 Da. Analysis of the 
predicted amino acid residues sequence (SEQ ID NO: 6) using the 
Spscan sofware (Wisconsin Sequence Analysis Package; Genetics 
Computer Group) suggested the existence of a 19 amino acid 
residues signal peptide (mikrckgiglalmafflva) . 

The SHB-GAS-104 gene was shown . to be present after PCR 
amplification using the oligonucleotide primers DMAR197 6 and 
DMAR1977 in the 4 serologically S_^ pyogenes strains tested 
(Table 2) . The methods used for PCR amplification of the SHB- 
GAS-104 gene were similar to the methods presented in Example 1. 

In addition, the distribution of. the SHB-GAS-104 polypeptide 
among S^ pyogenes isolates have been evaluated as described in 
Example 1 using a S^_ pyogenes extract prepared by the following 
method: bacteria were grown in TH broth with 0.5% Yeast extract 
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and 1% peptone extract overnight at 37°C in a 8% C0 2 atmosphere. 
The bacterial suspension was adjusted to an OD 6 oon™ of 0.8. After 
centrifugation, the bacterial pellet was resuspended in 500 pi 
of extraction buffer (0.1 M Tris-HCl pH 7.6) and pelleted by 
5 centrifugation. The pellet was frozen for 10 min at - 80°C and 
thawed at 37°C for 5 min. The freeze-thaw cycle was repeated 
three times. Bacterial pellet was resuspended in 500 pi of 
extraction buffer and sonicated 8 x 15 sec. Samples were 
centrifuged (14 000 X g) at 4°C for 15 min and supernatant was 

10 boiled for 5 min in SDS-PAGE sample buffer. Antibodies present 
in the pooled mouse ant i-SHB-GAS-104 sera recognized the SHB- 
GAS- 104 polypeptide band (^37.5 kDa) in all of the 13 S. 
pyogenes preparations tested (Table 3) .. In order to confirm the 
nature of the polypeptide band, anti-SHB-GAS-104 sera were 

15 adsorbed with 12.5 pg of purified recombinant SHB-GAS-104 
polypeptide (overnight, 4°C) . As expected, this adsorbed serum 
did not recognize, in the S_^ pyogenes extract applied onto the 
gel, the polypeptide band at «37.5 kDa confirming that this 
polypeptide band was the SHB-GAS-104 polypeptide. 

20 

These PCR data presented in the previous paragraphs clearly 
demonstrated that the SHB-GAS-104 gene is highly conserved among 
streptococcal strains. 

25 EXAMPLE 4 

This example illustrates the cloning of pyogenes genes in CMV 
plasmid pCMV-GH. 

The DNA coding regions of pyogenes polypeptides were inserted 

30 in phase downstream of a human growth hormone (h<3H) gene which 

was under the transcriptional control of the cytomegalovirus 

(CMV) promotor in the plasmid vector pCMV-GH {Tang et al., 

Nature, 1992, 356 :152). The CMV promotor is a non functional 

plasmid in coli cells but active upon administration of the 

57 



WO 2005/0 17093 

plasmid in eukaryotic cells, 
ampicillin resistance gene. 



PCT/CA2O04/O0151O 

The vector also incorporated the 



In order to remove the BamHl site into the SHB-GAS-102 gene, 
5 mutagenesis experiments using the Quickchange Site-Directed 
Mutagenesis kit from Stratagene were performed according to the 
manufacturer's recommendations. The oligonucleotides DMAR2841 
and DMAR2842 (Table 1) and the SHB-GAS-102 gene cloned into pET 
vector as DNA template were used to perform the mutagenesis 
0 experiments. The modification on the SHB-GAS-102 gene generated 
by site-directed mutagenesis is presented in the Table 1. 

The coding regions of modified SHB-GAS-102 , SHB-GAS-103 (SEQ ID 
NO: 3), and SHB-GAS-104 (SEQ ID NO: 5) genes were amplified by 
5 PCR (Robocycler Gradient 96 Temperature cycler, Stratagene, 
LaJolla, Ca) from pET vector containing the modified SHB-GAS-102 
gene or the genomic DNA of serotype Ml S_^ pyogenes strain 
ATCC700294 (for SHB-GAS-103 and SHB-GAS-104 genes) using 
oligonucleotide primers that contained base extensions for the 
\0 addition of restriction sites BamHl (GGATCC) and Sail (GTCGAC) 
which are described in Table 1. The PCR products were purified 
from agarose gel using a QIAquick gel extraction kit from QIAgen 
(Chatsworth, CA) and digested with restriction enzymes (Amersham 
Biosciences Inc, Baie d'Urfe, Canada). The pCMV-GH vector 
\5 (Laboratory of Dr. Stephen A. Johnston, Department of 
Biochemistry, The University of Texas, Dallas, Texas) was 
digested with BamHl and Sail and purified from agarose gel using 
the QIAquick gel extraction kit from QIAgen (Chatsworth, CA) . 
The BamHl-Sall DNA fragments were ligated to the BamHI-Sall 
>0 pCMV-GH vector to create the hGH-SHB-GAS-102 , hGH-SHB-GAS-103 , 
and hGH-SHB-GAS-104 fusion polypeptides under the control of the 
CMV promoter. The ligated products were transformed into coli 
strain DH5cx [<|>80dlacZAM15 A ( lacZYA-argF) U169 ' e^dAl recAl 
hsdR17 (r K -m K +) deoR thi-1 supE44 ATgyrA96 relAl] (Gibco BRL, 
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Gaithersburg , MD) according to the method of Simanis (Hanahan, 
D . DNA Cloning, 1985, D.IYi. Glover (ed) , pp. 109-135). The 
recombinant pCMV plasmids were purified using the GenElute 
plasmid kit ( Sigma-Aldr ich company Ltd, MO) and the nucleotide 
5 sequences of the DNA inserts were verified by DNA sequencing. 

EXAMPLE 5 

This example illustrates the use of DNA to elicit an immune 
response to pyogenes polypeptide antigens. 

0 

Groups of 8 female BALB/c mice (Charles River, St-Constant, 
Quebec, Canada) were immunized by intramuscular injection of 100 
pi three times at two- or three-week intervals with 50 pg of 
recombinant pCMV-GH encoding modified SHB-GAS-102 , . SHB-GAS-103 

5 (SEQ ID NO: 3), and SHB-GAS-104 (SEQ ID NO: 5) genes in presence 
of 50 pg of granulocyte-macrophage colony-stimulating factor 
(GM-CSF)- expressing plasmid pCMV-GH-GM-CSF (Laboratory of Dr. 
Stephen A. Johnston, Department of Biochemistry, The University 
of Texas, Dallas, Texas). As control, groups of mice were 

10 injected with 50 pg of pCMV-GH in presence of 50 pg of pCMV-GH- 
GM-CSF. Blood samples were collected from the orbital sinus 
prior to each immunization and seven days following the third 
injection and serum antibody responses were determined by ELISA 
using the corresponding His-tagged labeled pyogenes 

>5 recombinant polypeptides as coating antigens. The production and 
purification of these His-tagged labeled pyogenes recombinant 
polypeptides are presented in Example 6. 

EXAMPLE 6 

30 This example illustrates the production and purification of S . 
pyogenes recombinant polypeptides. 

The recombinant pET-1 9b ( + ) plasmids with SHB-GAS-102 (SEQ ID NO: 
1), SHB-GAS-103 (SEQ ID NO: 3), and SHB-GAS-104 (SEQ ID NO: 5) 
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genes were used to transform by electroporation (Gene Pulser II 
apparatus, BIO-RAD Labs, Mississauga, Canada) coli strains 

BL21 (DEB) (F'ompT hsdS B (r~ B nf B ) 9 al dcm (DE3) ) or BL21 star 
(DE3) (F'ompT hsdS B (r~ B irf B ) gal dcm rnel31 (DE3) ) (Novagen, 

5 Madison, WI ) . In these strains of E^ coli , the T7 promotor 
controlling expression of the recombinant polypeptide is 
specifically recognized by the T7 RNA polymerase (present on the 
A,DE3 prophage) whose gene is under the control of the lac 
promotor which is inducible by isopropyl-fi-d-thio- 

10 gala.ctopyranoside (IPTG) - The t ransf ormants BL2 1 ( DE3 ) / rpET or 
BL21 star ( Des ) / rpET were grown at 37°C -with agitation at 250 
rpm in LB broth (peptone lOg/L, yeast extract 5g/L, NaCl lOg/L) 
containing 100 pg of carbenicillin (Sigma-Aldrich Canada Ltd., 
Oakville, Canada) per ml until the A 6 oo reached a value of 0,6. 

15 In order to induce the production of His-tagged pyogenes 
recombinant polypeptides, the cells were incubated for 2 
additional hours in the presence of IPTG at a final 
concentration of 1 mM, Induced cells from a 500 ml culture 
were pelleted by centrif ugation and frozen at -70°C. 

20 

The purification of the recombinant polypeptides from the 
soluble fraction of IPTG-induced BL21 (DE3) /rpET19b (+) of BL21 
star (DE3) /rpET19b (+)■ was done by affinity chromatography based 
on the properties of the HisoTag sequence (10 consecutive 

25 histidine residues) to bind to divalent cations (Ni ) 
immobilized on the HisoBind metal chelation resin. Briefly, the 
pelleted cells obtained from a 4 00 mL culture induced with IPTG 
was resuspended in lysis buffer (20 mM Tris, 500 mM NaCl, 10 mM 
imidazole, pH 7.9), sonicated and centrif uged at 12,000 X g for 

30 20 min to remove debris.. The supernatant was incubated with Ni- 

NTA agarose resin (QIAgen) for 45 min at 4°C. The S. pyogenes 

recombinant polypeptides were eluted from the resin with a 

solution of 250 mM imidazole-500mM NaCl-20 mM Tris, pH 7.9. The 

removal of the salt and imidazole from the samples was done by 
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dialysis against PBS buffer overnight at 4 W C. The amount of 
recombinant polypeptide was estimated by MicroBCA (Pierce, 
Rockf ord f 111 . ) . 

5 EXAMPLE 7 

This example illustrates the accessibility to antibodies of the 
S . pyogenes recombinant polypeptides at the surf ace of intact 
streptococcal cells . 

0 Bacteria were grown in Todd Hewitt (TH) broth (Difco 
Laboratories, Detroit, MI) with 0.5% Yeast extract (Difco 
Laboratories) and 1% peptone extract (Merck, Darmstadt, Germany) 
at 37°C in a 8% C0 2 atmosphere to give an OD 60 onm of 0.600 (-10 9 
CFU/ml) . Dilutions of ant i-SHB-GAS-102 , anti-SHB-GAS-103, anti- 

5 SHB-GAS-104, or control sera were then added and allowed to bind 
to the cells, which were incubated for 2 h at 4°C. Samples were 
washed 2 times in blocking buff er [phosphate-buff ered saline 
(PBS) containing 2% bovine serum albumin ( BSA) ] , and then 0.5 ml 
of goat fluorescein ( FITC) -con jugated anti-mouse IgG + IgM 

10 diluted in blocking buffer was added. After an additional 
incubation of 60 min at room temperature, samples were washed 2 
times in blocking buffer and fixed with 0.3 % formaldehyde in 
PBS buffer for 18-24 h at 4°C- Cells were kept in the dark at 
; 4°C until analyzed by flow cytometry (Epics€> XL; Beckman 

15 Coulter, Inc.). Ten thousands intact S_^ pyogenes cells were 
analyzed per sample. 

EXAMPLE 8 

This example illustrates the protection against fatal S_;_ 
JO pyogenes infection induced by passive immunization of mice with 
rabbit hyper-immune sera. 

New Zealand rabbit (Charles River) was immunized subcutaneously 

three times at 3-week intervals at multiple sites with 100 of 

61 



WO 2005/017093 PCT/CA2004/001510 

His~tagged recombinant SHB-GAS-102 protein in the presence of 
Freund's incomplete adjuvant (Gibco-BRL) . Blood samples were 
collected three weeks after the third injection. The antibodies 
present in the serum were partially purified by precipitation 

5 using 40% saturated ammonium sulfate solution. Groups of 6 
female Balb/c mice (Charles River) were injected intravenously 
with 500 pi of partially purified serum collected from a rabbit 
immunized with the recombinant SHB-GAS-102 protein, or a rabbit 
immunized with an unrelated control recombinant protein. 

10 Eighteen hours later the mice were challenged with approximately 
2xl0 7 CFU of the type 3 pyogenes strain ATCC12384. Samples 

of the pyogenes challenge inoculum were plated on blood agar 
plates to determine the CFU and to verify the challenge dose. 
Deaths were recorded for a period of 8 days. 

15 

As presented in Table 4, 67% (4/6) mice immunized with partially 
purified serum collected from rabbit immunized with the 
recombinant SHB-GAS-102 protein were protected against a lethal 
challenge. On the contrary, immunization of mice with serum 
20 collected from rabbit immunized with an irrelevant protein did 
not confer such protection (Table 4). 



Table 4. Ability of SHB-GAS-102-specif ic antibodies to elicit 
passive protection against pyogenes strain ATCC12384 (M3) 



Group 


No. mice surviving 


% survival 


SHB-GAS-1 02 -specif ic 
antibodies 


4/6 


67 % 


Unrelated protein- 
specific antibodies 


0/6 


0 % 



25 
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EXAMPLE 9 

This example illustrates the protection of mice against fatal S. 
pyogenes infection induced by immunization with recombinant 
polypeptides . 

5 

Groups of female Balb/c mice (Charles River) were immunized 
subcutaneously three times at two-week intervals with 20 pg of 
affinity purified SHB-GAS-102, SHB-GAS-103, or SHB-GAS-104 His- 
tagged recombinant . polypeptides in presence of 10 pg of QuilA 

10 adjuvant (Cedarlane Laboratories Ltd, Hornby, Canada) or, as 
control, with QuilA adjuvant alone in PBS or with 20 pg of 
irrelevant polypeptide in presence of QuilA. Blood samples were 
collected from the orbital sinus oh day 1, 14 and 28 prior to 
each immunization and 14 days (day 42) following the third 

15 injection. One week later the mice were challenged with 
approximately 4-8xl0 6 CFU of the type 3 pyogenes strain 

ATCC12384. Samples of the S^ pyogenes challenge inoculum were 
plated on blood agar plates to determine the CFU and to verify 
the challenge dose. Deaths were recorded for a period of 7 

10 days . 

As presented in Table 5, more than 83% (10/12) mice immunized 
with SHB-GAS-102 polypeptide were protected against a lethal 
challenge. Mice immunized with SHB-GAS-103 and SHB-GAS-104 were 
25 also protected against a lethal challenge (7/12) . On the 
contrary, immunization of mice with adjuvant only or with an 
irrelevant protein did not confer such protection (Table 5). 

Table 5. Ability of recombinant SHB-GAS-102, SHB-GAS-103 , and 
30 SHB-GAS-104 polypeptides to elicit protection a-gainst S. 



pyogenes strain ATCC12384 (M3) 



Groups 


No . mxce surviving 


% survival 








20pg of SHB-GAS-102 
polypeptide + lOpg of QuilA 


10/12 


83 
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Groups 


No. mice surviving 


% survival 








20pg of SHB-GAS-103 
polypeptide + lOpg of QuilA 


7/12 


58 


20pg of SHB-GAS-104 
polypeptide + lOpg of QuilA 


7/12 


58 


20pg of irrelevant 
polypeptide + lOpg of QuilA 


2/12 


17 


10 ug of QuilA in PBS 


3/12 


25 
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Figure 1 

1 atgtcacgta ttggtaataa agtaattact atgcctgcag gcgttgaatt aacaaataac 

61 aacaatgtta ttactgttaa aggccctaaa ggcgaactca ctcgtgagtt caacaaaaat 

121 attgaaatca aagttgaagg gactgaaatc acagttgtac gtcctaacga ctcaaaagaa 

181 atgaaaacaa tccatggtac aacccgtgct aacttgaata acatggttgt aggtgtttct 

241 gaaggtttca aaaaagatct tgaaatgaag ggtgtcggtt accgtgct.ca acttcaaggt 

301 actaaacttg tcctttcagt aggtaaatct caccaagacg aagttgaagc tccagaagga 

361 attactttca ctgttgctaa cccaacttca atctcagttg aaggaatcaa caaagaagtt 

421 gttggtcaaa cagctgctta catccgtagc ttgcgttcac cagagcctta caaaggcaaa 

481 gggatccgtt acgttggtga atacgtacgc cttaaagaag gtaaaacagg taaataa 

(SEQ ID NO: 1) 



Figure 2 

1 MSR1GNKVIT MPAGVELTNN NNVITVKGPK GELT RE FN KN IEIKVEGTEI TVVRPNDSKE 
61 MKTIHGTTRA l^NNMVVGVS EGFKKDLEMK GVGYRAQ1»QG TKLVLSVGKS HQDEVEAPEG 
121 ITFTVANPTS ISVEGINKEV VGQTAAYIRS LRSPEPYKGK GIRYVGEYVR LKEGKTGK 
(SEQ ID NO: 2) 



Figure 3 

1 atqtttcagt taagaaaaaa aatgacqcqc aaacaattag ccttgttgag tgctggagtq 
61 ttgacctqtg tggttggtgg t agctacttg ataatgaacc atcaacaaca agaaattgtc 
121 tctagtgtca acaaagtaaa agecttaacc ataaaagaag ccatggaaca aggaaaagat 
181 atcagcttga ccttagctgg cgaagtaaca gctaacaaca gcagcaaagt caaaatcgac 
241 tcaagtaaag gagaagtcaa agaggtcttt gttaaaaaag gcgatgttgt caaagtagga 
301 caacccttgt ttagctatga aacgtcacag cggttaacgg ctcaaagttc agaatttgat 
361 gttcaaacca aagccaatca gctccaagtt gctaaaacca atgcagcatt gaagtgggaa 
421 acctaca'atc gcaaggtcaa tgaaatcaac accctaaaat ctcgetacaa cactgcacca 
481 gatgagagct tactagagca aattcgcagc qcagaagaca gtgtatccca agcactaagc 
541 gatgccaaaa cagcagatag cgatgtcaaa accgctcaaa tcgaactcga taaagctaat 
601 gctactgcca caacggaaaa aggtaaacta gagtatgaca ccgttaagtc agacaccgca 
661 ggaaccattg ttagtctaaa tactgatttg ccaaatcaat caaaatccaa aaaagaaaat 
721 gaaactttta tggaaattat cgacaaatca aaaatgttag tcaaaggtaa cattagtgaa 
781 tttgaccgtg acaagttaaa aatcggtcaa aaagtcgaag tgattgaccg caaagacaac 
841 tctaaaaaat ggactggaaa agtaacccaa gttggcaacc tcaaagcaga ggaaaaaggc 
901 caaggtcaag gccaaggtgg caatgaccaa caagataatc caaaccaagc aaaattccct 
961 tatgttattg aacttgacca atcagacaag cagccactca ttggctcaca cacctatgtt 
1021 aatgtgctca acaatgttcc agaagctggc aagatcgtat tgaaagaaac ctttacaatg 
1081 gcagaaaatg gaaaaaccta tgtgtggaaa gttgataaaa acaaggtcaa aaaacaagaa 
114 1 atcaagacta agcccttctc aaaaggttat gttgaggtaa caagtggctt gactatgcaa 
1201 gataagattg ctcagccgct tcctggcatg aaagacggta tggaggtagg aagtattgtt 
1261 aaaccttaa. 
(SEQ ID NO: 3) 



Figure 4 

1 MFQLRKKMTR KQLALLSAGV LTCWGG SYL IMNHQQQEIV SSVNKVKAIiT IKEAMEQGKO 

61 I SLTLAGEVT ANNSSKVKID SSKGEVKEVF VKKGDVVKVG QPLFSYETSQ RLTAQSSEFO 

121 VQTKANQLQV AKTNAALKWE TYNRKVNEIN TLKSRYNTAP DESLLEQIRS AEDSVSQALS 

181 DAKTADS DVK TAQIELDKAN ATATTEKGKL EYDTVKSDTA GTIVSLNTDL PNQSKSKKEN 

241 ETFMEIIDKS KMLVKGNISE FDRDKLKIGQ KVEVIDRKDN SKKWTGKVTQ VGNLKAEEKG 

301 QGQGQGGNDQ QDNPNQAKFP YVIELDQSDK QPLIGSHTYV NVLNNVPEAG KIVLKETFTM 

361 AENGKTYVWK VDKNKVKKQE IKTKPFSKGY VEVTSGLTMQ DKIAQPLPGM KDGMEVGSIV 

421 KP 

(SEQ ID NO: 4) 
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Figure 5 



61 


gtgaatcagc 


121 


gttgctgtgg 


181 


aaattatata 


241 


cctgatatag 


301 


caagaagatt 


361 


cgaagt gt tg 


421 


caagaagcaa 


461 


cgtaagggga 


54 1 


gtggcgacga 


601 


tatcagtcag 


661 


cct gactt ga 


721 


aaagaatttg 


781 


taligatttgg 


84 1 


gacaccttaa 


(SEQ ID 


NO: 5) 



Figure 6 

1 M1KRCKGIGL ALMA FFL VA C VNQHPKTAKE TEQQRI VATS VAVVDICDRL NLDLVGVCDS 

61 KLYTLPKRYD AVKRVGLPMN PDIELIASLK PTWILSPNSL QEDLEPKYQK LDTEYGFLNL, 

123 RSVEGMYQSI DDLGNLFQRQ QEAKELRQQY QDYYRAFQAK RKGKKKPKVL ILMGLPGS YL 

181 VATNQSYVGN LLDLAGGENV YQSDEKEFLS ANPEDMLAKE PDLILRTAHA I PDKVKVMFD 

24 1 KEFAENDIWK HFTAVKEGKV YDLDNTLFGM SAKLNYPEAL DTLTQLFDHV GDHP 

(SEQ ID NO: 6) 



Figure 7 

1 atgtcacgta ttggtaataa agtaattact atgcctgcag gcgttgaatt aacaaataac 

61 aacaatgtta ttactgttaa aggccctaaa ggcgaactca ctcgtgagtt taacaaaaat 

121 attgaaatca aagttgaagg gactgaaatc acagttgtac gtcctaacga ctcaaaagaa 

181 atgaaaacaa tccatggtac aacccgtgct aacttgaata acatggttgt aggtgtttct 

241 gaaggtttca aaaaaqatct tgaaatgaag ggtgt-cggtt accgtgctca acttcaaggt 

301 actaaacttg tcctttcagt aggtaaatct caccaagacg aagttgaacjc tccagaagga 

361 attactttca ctgttgctaa cccaacttca atctcagttg aaggaatcaa caaagaagtt 

421 gttggtcaaa cagctgctta catccgtagc ttgcgttcac caga-gcctta caaaggcaaa 

481 gggatccgtt acgttggtga atacgtacgc cttaaagaag gtaaaacagg taaataa 

(SEQ ID NO: 21) 



Figure 8 

1 MSRIGNKVI T MPAGVELTNN NNVITVKGPK GELTREFNKN IEIKVEGTEI TVVRPNDSKE 
61 MKTIHGTTRA NLNNMWGVS EGFKKDLEMK GVGYRAQLQG TKLVLSVGKS KQDEVEAPEG 
121 ITFTVANPTS ISVEGINKEV VGQTAAYIRS LRSPEPYKGK GIRYVGEYVR LKEGKTCK 
(SEQ ID NO: 22) 
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Figure 9 

1 atgtcacgta ttggtaataa agtaattact atgcctgcag gcgttgaatt aacaaataac 

63 aacaatgtta ttactgttaa aggccctaaa ggcgaactca ctcgtgagtt taacaaaaat 

121 attgaaatca aagttgaagg gactgaaatc acagttgtac gtcctaacga ctcaaaagaa 

181 atgaaaacaa tccatggtac aacccgtgct aacttgaata acatggttgt aggtgtttct 

241 gaaggtttca aaaaagatct tgaaatgaag ggtgtcggtt accgtgctca acttcaaggt 

301 actaaacttg tcctttcagt aggtaaatct caccaagacg aagttgaagc tccagaagga 

361 attactttca ctgttgctaa cccaacttca atctcagttg aaggaatcaa caaagaagtt 

421 gttggtcaaa cagctgctta catccgtagc ttgcgtteac cagagcctta caaaggcaaa 

481 gggatccgtt acgttggtga atacgtacgc cttaaagaag gtaaaacagg taaataa 

(SEQ ID NO: 23) 



Figure 10 

1 MSRIGNKVIT MPAGVELTNN NNVITVKGPK GELTREFNKN I EI KVEGTEI TVVRPNDSKE 
61 MKTIHGTTRA NLNNMVVGVS EGFKKDLEMK GVGYRAQLQG TKLVLSVGKS HQDEVEAPEG 
121 ITFTVANFTS ISVEGINKEV VGQTAAYIRS LRSPEPYKGK GIRYVGEYVR LKEGKTGK 
(SEQ ID NO: 24) 



Figure 11 

1 atgtcacgta ttggtaataa agtaattact atgcctgcag gcgttgaatt aacaaataac 

61 aacaatgtta ttactgttaa aggccctaaa ggcgaactca ctcgtgagtt caacaaaaat 

121 attgaaatca aagttgaagg gactgaaatc acagttgtac gtcctaacga ctcaaaagaa 

181 atgaaaacaa tccatgggac aacccgtgct aacttgaata acatggttgt aggtgtttct 

241 gaaggtttca aaaaagatct tgaaatgaag ggtgtcggtt accgcgctca acttcaaggt 

301 actaaacttg tcctttcagt aggtaaatct caccaagacg aagttgaagc tccagaagga 

361 attactttca ctgttgctaa cccaacttca atctcagttg aaggaatcaa caaagaagtt 

421 gttggtcaaa cagctgctta catccgtagc ttgcgtteac cagagcctta caaaggcaaa 

481 gggatccgtt acgttggtga atacgtacgc cttaaagaag qtaaaacagg taaataa 
(SEQ ID NO: 25) 



Figure 12 

1 MSRIGNKVIT MFAGVELTNN NNVITVKGPK GELTREFNKN IEIKVEGTEI TVVRPNDSKE 
61 MKTIHGTTRA NLNNMVVGVS EGFKKDLEMK GVGYRAQLQG TKLVLSVGKS HQDEVEAPEG 
121 ITFTVANPTS ISVEGINKEV VGQTAAYIRS LRSPEPYKGK GIRYVGEYVR LKEGKTGK 
(SEQ ID NO: 26) 



Figure 13 

1 atgtcacgta ttggtaataa agtaattact atgcctgcag gtgttgaatt aacaaataac 

61 aacaatgtta ttactgttaa aggccctaaa ggcgaactca ctcgtgagtt caacaaaaat 

121 attgaaatca aagttgaagg gactgaaatc acagttgtac gtcctaacga ctcaaaagaa 

181 atgaaaacaa tccatggtac aacccgtgct aacttgaata acatggttgt aggtgtttct 

241 gaaggtttca aaaaagatct tgaaatgaag ggtgtcggtt accgtgctca acttcaaggt 

301 actaaacttg tcctttcagt aggtaaatct caccaagacg aagttgaagc tccagaagga 

361 attactttca ctgttgctaa cccaacttca atctcagttg aaggaatcaa caaagaagtt 

421 gttggtcaaa cagctgctta catccgtagc ttgcgtteac cagagcctta caaaggcaaa 

481 gggatccgtt acgttggtga atacgtacgc cttaaagaag gtaaaacagg taaataa 
(SEQ ID NO: 27) 
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Figure 14 

1 MSRIGNKVIT MPAGVELTNN NNVITVKGPK GELTREFNKN IEIKVEGTEI TVVRPNDSKE 
61 MKTIHGTTRA NLNNMVVGVS EGFKKDLEMK GVGYRAQLQG . TKLVLSVGKS HQDEVEAPEG 
121 3TFTVANPTS ISVEGINKEV VGQTAAYJRS LRSPEPYKGK GIRYVGEYVR LKEGKTGK 
(SEQ ID NO: 28) 



Figure 15 

1 atqtttcaqt taaqaaaaaa aatqacgcgc aaacaattaq ccttqttqaq tqctqqagtq 
61 ttqacctqtg tqqttqqtqq t aqctacttq ataatgaacc atcaacaaca agaaattgtc 
121 tctagtgtca acaaagtaaa agccttaacc ataaaagaag ccatggaaca aggaaaagat 
181 atcagcttga ccttagctgg cgaagtaaca gctaacaaca gcagcaaagt caaaatcgac 
241 tcaagtaaag gagaagtcaa agaggtcttt gttaaaaaag gcgatgttgt caaagtagga 
301 caacccttgt ttagctatga aacqtcacag cggttaacgg ctcaaagttc agaatttgat 
361 gttcaaacca aagccaatca gctccaagtt gctaaaacca atgcagcatt gaagtgggaa 
421 acctacaatc gcaaggtcaa tgaaatcaac accctaaaat ctcgctacaa cactgcacca 
481 gatgagagct tactagagca aattcgcagc gcagaagaca gtgtatccca agcactaagc 
541 gatgccaaaa cagcagatag cgatgtcaaa accgctcaaa tcgaactcga taaagctaat 
.. 601 gctactgcca caacggaaaa aggtaaacta gagtatgaca ccgttaagtc agacaccgca 
661 ggaaccattg ttagtctaaa tactgatttg ccaaatcaat caaaatccaa aaaagaaaat 
721 gaaactttta tggaaattat cgacaaatca aaaatgttag tcaaaggtaa cattagtgaa 
781 tttgaccgtg acaagttaaa aatcggtcaa aaagtcgaag tgattgaccg caaagacaac 
841 tctaaaaaat ggactggaaa agtaacccaa gttggcaacc tcaaagcaga ggaaaaaggc 
901 caaggtcaag gccaaggtgg caatgaccaa caagataatc caaaccaagc aaaattccct 
961 tatgttattg aacttgacca atcagacaag cagecactca ttggctcaca cacctatgtt 
1021 aatgtactca acaatgttcc agaagctggc aagatcgtat tgaaagaaac ctttacaatg 
1081 gcagaaaatg gaaaaaccta tgtgtggaaa gttgataaaa acaaggtcaa aaaacaagaa 
. .1141 atcaagacta agcccttctc aaaaggttat gttgaggtaa caagtggctt gactatgcaa 
.1201 gataagattg ctcagccgct tcctggcatg aaagacggta tggagqtagg aagtattgtt 
1261 aaaccttaa 
(SEQ ID NO: 29) 

Figure 16 

1 MFQLRKKMTR KQLALLSAGV LTCVVGG SYL IMNHQQQEIV SSVNKVKALT IKEAMEQGKD 

61 1SL.TLAGEVT ANNSSKVKID SSKGEVKEVF VKKGDVVKVG QPLFSYETSQ RLTAQSSEFD 

121 VQTKANQLQV AKTNAALKWE TYNRKVNEIN TLKSRYNTAP DESLLEQIRS AEDSVSQALS 

181 DAKTADSDVK TAQIELDKAN ATATTEKGKL EYDTVKSDTA GTIVSLNTDL PNQSKSKKEN 

241 ETFMEIIDKS KMLVKGNISE FDRDKLKIGQ KVEVIDRKDN SKKWTGKVTQ VGNLKAEEKG 

301 QGQGQGGNDQ QDNPNQAKFP YVIELDQSDK QPLIGSHTYV NVLNNVPEAG KIVLKETFTM 

361 AENGKTYVWK VDKNKVKKQE IKTKPFSKGY VEVTSGLTMQ DKIAQPLPGM KDGMEVGSIV 
421 KP 
(SEQ ID NO: 30) 



Figure 17 



1 atgtttcagt 
61 ttgacctqtq 



121 tctagtgtca 
181 atcagcttga 
241 tcaagtaaag 
301 caacccttgt 
.361 gttcaaacca 
421 acctacaatc 
481 gatgagagct 
541 gatgccaaaa 
601 gctactgcca 



taagaaaaaa 
tqqttqqtqq 



acaaagtaaa 
ccttagctgg 
gagaagtcaa 
ttagctatga 
aagccaatca 
gcaaggtcaa 
tactagagca 
cagcagatag 
caacggaaaa 



aatgacgcgc 
_tagctacttg 
agccttaacc 
cgaagtaaca 
agaggtcttt 
aacgtcacag 
gctccaagtt 
tgaaatcaac 
aattcgcagc 
cgatgtcaaa 
aggtaaacta 

4/18 



aaacaattag 
ataatgaacc 
ataaaagaag 
gctaacaaca 
gttaaaaaag 
cggttaacgg 
gctaaaacca 
accctaaaat 
gcagaagaca 
accgctcaaa 
gagtatgaca 



ccttgttgag 
atcaacaaca 
ccatggaaca 
gcagcaaagt 
gcgatgttgt 
ctcaaagttc 
atgcagcatt 
ctcgctacaa 
gtgtatccca 
tcgaactcga 
ccgttaagtc 



tgctggagtg 
agaaattgtc 
aggaaaagat 
caaaatcgac 
caaagtagga 
agaatttgat 
gaagtgggaa 
cactgcacca 
agcactaa-gc 
taaagctaat 
agacaccgca 
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Figure 17 (continued) 

661 ggaaccattg ttagtctaaa tactgatttg ccaaatcaat caaaatccaa aaaagaaaat 

721 gaaactttta tggaaattat cgacaaatca aaaatgttag tcaaaggtaa cattagtgaa 

781 tttgaccgtg acaagttaaa aatcggtcaa aaagtcgaag tgattgaccg caaagacaac 

841 tctaaaaaat ggactggaaa agtaacccaa gttggcaacc tcaaagcaga ggaaaaaggc 

901 caaggtcaag gccaaggtgg caatgaccaa caagataatc caaaccaagc aaaattccct 

961 tatgttattg aacttgacca atcagacaag cagccactca ttggctcaca cacctatgtt 

1021 aatgtactca acaatgttcc agaagctggc aagatcgtat tgaaagaaac ctttacaatg 

1081 gcagaaaatg gaaaaaccta tgtgtggaaa gttgataaaa acaaggtcaa aaaacaagaa 

1141 atcaagacta agcccttctc aaaaggttat gttgaggtaa caagtggctt gactatgcaa 

1201 gataagattg ctcagccgct tcctggcatg aaagacggta tggaggtagg aagtattgtt 
1261 aaaccttaa 
(SEQ ID NO: 31) 



Figure 18 

1 MFQLRKKMTR KQLALLSAGV LTCVVGG SYL IMNHQQQEIV SSVNKVKALT IKEAMEQGKD 

61 ISLTLAGEVT ANNSSKVKID SSKGEVKEVF VKKGDVVKVG QPLFSYETSQ RLTAQSSEFD 

1.21 VQTKANQLQV AKTNAALKWE TYNRKVNEIN TLKSRYNTAP DESLLEQ1RS AEDSVSQALS 

181 DAKTADSDVK TAQIELDKAN ATATTEKGKL EYDTVKSDTA GTIVSLNTDL PNQSKSKKEN 

241 ETFMEI IDKS KMLVKGN1SE FDRDKLKIGQ KVEVIDRKDN SKKWTGKVTQ VGNLKAEEKG 

301 QGQGQGGNDQ QDNPNQAKFP YVIELDQSDK QPLIGSHTYV NVLNNVPEAG KIVLKETFTM 

361 AENGKTYVWK VDKNKVKKQE 1KTKPFSKGY VEVTSGLTMQ DKIAQPLPGM KDGMEVGSIV 
421 KP 
(SEQ ID NO: 32) 



Figure 19 

1 atqtttcagt taaga.aaaaa aatqacgcqc aaacaattaq ccttqttgag tqctggaqtq 

61 ttqacctqtq tqqttqqtqq t aqctacttq ataatgaacc atcaacaaca agaagttgtc 

121 tctagtgtca acaaagtaaa agccttaacc ataaaagaag ccatggaaca aggaaaagat 

181 atcagcttga ccttagctgg cgaagtaaca gctaacaaca gcagcaaagt .caaaatcgac 

241 tcaagtaaag gagaagtcaa agaggtcttt gtcaaaaaag gcgatgttgt caaagtagga 

301 caacccttgt ttagctatga aacgtcacag cggttaacgg ctcaaagttc agaatttgat 

361 gttcaaacca aagccaatca actccaagtt gctaaaacca atgcagcatt gaagtgggaa 

421 acctacaatc gcaaggtcaa tgaaatcaac accctaaaat ctcgctacaa cactgcacca 

481 gatgagagct tactagagca aattcgcagc gcagaagaca gtgtatccca agcactaagc 

541 gatgccaaaa cagcagatag cgatgtcaaa accgctcaaa tcgaactcga taaagctaat 

.601 gctactgcca caatggaaaa aggtaaacta gagtatgaca ccgttaagtc agacaccgca 

661 ggaaccattg ttagcctaaa tactgatttg ccaaatcaat caaaatccaa aaaagaaaat 

721 gaaactttta tggaaattat cgacaaatca aaaatgttag tcaaaggtaa catcagtgaa 

781 tttgaccgtg acaagttaaa aatcgatcaa aaagtcgaag tgattgaccg caaagacaac 

841 tctaaaaaat ggactggaaa agtaacccaa gttggcaacc tcaaagcaga ggaaaaaggc 

901 caaggtcaag gccaaggtgg caatgaccaa caagacaatc caaaccaagc aaaattccct 

961 tatgttatcg aacttgacca atcagacaag cagccactca ttggctcaca cacctatgtt 
1021 aatgtgctca acaatgttcc agaagctggc aagatcgtat tgaaagaaac ctttacaatg 

1081 gcagaaaatg gaaaaaccta tgtgtggaaa gttgataaaa acaaggtcaa aaaacaagaa 
1141 atcaagacta agcccttctc aaaaggttat gttgaggtga caagtggctt gactatgcaa 

1201 gataagattg ctcagccgct tcctggcatg aaagacggta tggaggtagg aagtattgtt 
1261 aaaccttaa 
SEQ ID NO: 33 
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Figure 20 

1 MFQLRKKMTR KQLALLSAGV LTCWGG SYL IMNHQQQEVV SSVNKVKALT IKEAMEQGKD 
61 I SLTLAGEVT ANNSSKVKID SSKGEVKEVF VKKGDVVKVG QPLFSYETSQ RLTAQSSEFD 
121 VQTKANQLQV AKTNAALKWE TYNRKVNEIN TLKSRYNTAP DESLLEQIRS AEDSVSQALS 
181 DAKTADSDVK TAQIELDKAN ATATMEKGKL EYDTVKSDTA GTIVSLNTDL PNQSKSKKEN 
241 ETFME1 1DKS KMLVKGNISE FDRDKLK1DQ KVEV1DRKDN SKKWTGKVTQ VGNLKAEEKG 
301 QGQGQGGNDQ QDNPNQAKFP YVIELDQSDK QPLIGSHTYV NVLNNVPEAG KIVLKETFTM 
361 AENGKTYVWK VDKNKVKKQE 1KTKPFSKGY VEVTSGLTMQ DKIAQPLPGM KDGMEVGSIV 
421 KP 
(SEQ ID NO: 34) 



Figure 21 

1 at qt tt cagt taagaaaaaa aatqacqcqc aaacaattag ccttqttqag tqctqqaqtq 
61 ttqacctqtg tggttggtgg t acctacttg ataatgaatc atcaacaaca agaaattgtc 
121 tctagtgtca acaaagtaaa agccttaacc ataaaagaag ccatggaaca aggaaaagat 
181 atcagcttga ccttagctgg cgaagtaaca gctaacaaca gcagcaaagt caaaatcgac 
241 tcaagtaaag gagaagtcaa agatgtcttt gtcaaaaaag gcgatqttgt caaagtagga 
301. caacccttgt ttagctatga aacgtcacaa cggttaacgg ctcaaagttc agaatttgat 
361 gttcaaacca aagccaatca actccaagtt gctaaaacca atgcagcatt gaaqtgggaa 
421 acctacaatc gcaaggtcaa tgaaatcaat accctaaaat ctcgctacaa cactgcacca 
481 gatgagagct tactagagca aattcgcagc gcagaagaca gtgtatctca agcactaagc 
541 . gatgccaaaa cagcagatag. cgatgtcaaa accgctcaaa tcgaactcga taaagctaat 
601 gctactgccg caacggaaaa aggtaaacta gagtatgaca ccgttaagtc agacaccgea 
6.61 ggaaccattg ttagtctaaa tactqatttg ccaaatcaat caaaatccaa aaaagaaaat 
721 gaaactttta tggaaattat cgacaaatca aaaatgttag tcaaaggtaa catxragtgaa 
781 tttgaccgtg acaagttaaa aatcgatcaa aaagtcgaag tgattgaccg caaagacaac 
841 . tctaaaaaat ggactggaaa agtaacccaa gttggcaacc tcaaagcaga ggaaaaaggc 
901 caaggtcaag gccaaggtgg caatgaccaa caagataat.c caaaccaagc aaaattecct 
961 tatgttatcg aacttgacca atcagacaag cagccactca ttggttcaca cacctatgtt 
1021 aatgtgctca acaatgttcc agaagctggc aagatcgtat tgaaagaaac ctttacaatg 
1081 gcagaaaatg gaaaaaccta tgtgtggaaa gttgataaaa acaaggtcaa aaaacaagaa 
1141 atcaagacta agcccttctc aaaaggttat gttgaggtga caagcggctt gactatgoaa 
1201 gataagattg ctcagccgct tcctggcatg aaagacggta tggaggtagg aagtattgtt 
1261 aaaccttaa 
(SEQ ID NO: 35) 



Figure 22 

1 MFQLRKKMTR KQLALLSAGV LTCVVGG TYL IMNHQQQEIV SSVNKVKALT IKEAMEQGKD 
61 ISLTLAGEVT ANNSSKVKID SSKGEVKDVF VKKGDVVKVG QPLFSYETSQ RLTAQSSEFD 
121 VQTKANQLQV AKTNAALKWE TYNRKVNEIN TLKSRYNTAP DESLLEQIRS AEDSVSQALS 
181 DAKTADSDVK TAQIELDKAN ATAATEKGKL EYDTVKSDTA GTIVSLNTDL PNQSKSKKEN 
241 ETFMEIIDKS KMLVKGNISE FDRDKLKIDQ KVEVIDRKDN SKKWTGKVTQ VGNLKAEEKG 
301 QGQGQGGNDQ QDNPNQAKFP YVIELDQSDK QPLIGSHTYV NVLNNVPEAG KIVLKETFTM 
361 AENGKTYVWK VDKNKVKKQE IKTKPFSKGY VEVTSGLTMQ DKIAQPLPGM KDGMEVGSIV 
421 KP 
(SEQ ID NO: 36) 
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1 atqataaaac qatqtaaaqq aattqqtcta qtcttaatqq ccttcttttt ggtagct t g t 



61 


o t* aaatcaor 

y La a a i_ »w cj y 


121 




J. O J. 


a a a L ^ a u a L. a 




rrl* nat at An 


301 


Laouociya u »_ 


361 


rnaadtatto 


421 


cddgaagcda 


481 


cgcaagggga 


541 


gtggcgacga 


601 


tatcagtcag 


661 


cctgacttga 


721 


aaagaattt g 


781 


tatgat ttgg 


841 


gacaccttaa 


(SEQ ID 


NO: 37) 



Figure 24 

!■ MIKRCKGIGL VLMAFFLVA C VNQHPKTAKE TEQQRIVATS VAVVDICDRL NLDLVGVCDS 

61 KLYTLPKRYD AVKRVGLPMN PDIELIASLK PTWILSPNSL QEDLEPKYQK LDTEYGFLNL 

121 RSVEGMYQSI DDLGNLFQRQ QEAKELRQQY QDYYRAFQAK RKGKKKPKVL ILMGLPGSYL 

181 VATNQSYVGN LLDLAGGENV YQSDEKEFLS VNPEDMLAKE PDLILRT AHA IPDKVKVMFD 

241 KEFAENDIWK HFTAVKEGKV YDLDNTLFGM SAKLNYPEAL DTLTQLFDHV GDHP 
(SEQ ID NO: 38) 



Figure 25 

1 atqataaaac qatqtaaaqq aattggtcta qtcttaatqq ccttcttttt gqtagct tgt 
61 gtaaatcagc accctaaaac ggctaaagag actgaacagc agagaattgt agccacttcg 
121 gttgctgtgg ttgatatctg tgaccgttta aatttagacc tcgttggggt ttgtgatagt 
181 aaattatata cccttcctaa acgctatgat gctgttaagc gtgtgggttt acocatgaat 
241 cctgatatag agttgattgc ttctttgaaa ccaacttgga ttttgagtcc caattcttta 
301 caagaagatt tggaacccaa gtatcaaaaa ttggatactg agtatggttt tttgaactta 
361 cgaagtgttg agggcatgta ccagtctatt gatgatttag ggaacctttt ccaacgtcaa 
421 caagaagcaa aagaattgcg ccagcaatac caggactatt atcgtgcttt ccaagctaaa 
481 cgcaagggga agaaaaagcc taaagtgctt attcttatgg gcttgccagg tagttatttg 
541 gtggcgacga accaatctta tgtagggaat cttttggact tggcaggtgg tgagaatgtt 
601 tatcagtcag atgagaaaga atttctatca gttaatoctg aagacatgct agctaaggag 
661 cctgacttga ttttacgaac agetcacgcc attccagaca aggtaaaagt gatgtttgac 
721 aaagaatttg ctgaaaatga tatttggaaa cattttacgg caigtcaagga agggaaagtc 
781 tatgatttgg acaataccct gtttggcatg agtgctaaat tgaactaccc agaagccttg 
841 gacaccttaa cacagctttt tgaccacgtg ggagatcatc cgtaa 
(SEQ ID NO: 3.9) 



Figure 26 

1 MIKRCKGIGL VLMAFFLVA C VNQHPKTAKE TEQQRIVATS VAVVDICDRL NLDLVGVCDS 

61 KLYTLPKRYD AVKRVGLPMN PDIELIASLK PTWILSPNSL QEDLEPKYQK LDTEYGFLNL 

121 RSVEGMYQSI DDLGNLFQRQ QEAKELRQQY QDYYRAFQAK RKGKKKPKVL ILMGLPGSYL 

181 VATNQSYVGN LLDLAGGENV YQSDEKEFLS VNPEDMLAKE PDLILRTAHA IPDKVKVMF43 

241 KEFAENDIWK HFTAVKEGKV YDLDNTLFGM SAKLNYPEAL DTLTQLFDHV GDHP 
(SEQ ID NO: 40) 
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Figure 27 



l 
61 
121 
181 
■ 241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
(SEQ ID 



atgataaaac 
gtgaatcagc 
gt tgctgtgg 
aaattatata 
cct gatatag 
caagaagatt 
cgaagtgttg 
caagaagcaa 
cgt aagggga 
gtggcgacga 
tatcagtcag 
cct gatttga 
aaagaatttg 
tatgatttgg 
gacaccttaa 
NO: 41) 



gatqtaaaqq aattqqtcta qccttaatqq ccttcttttt 



accctaaaac 
ttgatatctg 
cccttcct aa 
agt tgattgc 
tggaacccaa 
agggcatgta 
aagaattgcg 
agaaaaagcc 
accaatctta 
atgagaaaga 
ttttacgaac 
ctgaaaatga 
acaat accct 
cacagctttt 



ggctaaagag 
tgaccgttta 
acgctatgat 
ttctttgaaa 
gtatcaaaaa 
ccagtccatt 
ccagcaatac 
taaagtgctt 
tgtagggaat 
atttctatca 
agct cacgcc 
tatttggaaa 
gtttggcatg 
tgaccgcgtg 



actgaacagc 
aat ttagacc 
gctgttaagc 
ccaacttgga 
ttggatactg 
gatqat ttag 
caggactatt 
attcttatgg 
cttttggatt 
gctaatcctg 
at tccagaca 
cattttacgg 
agtgctaaat 
ggagatcat c 



agagaattgt 
tcgttggggt 
gtgtgggttt 
ttt tgagt cc 
agtatggttt 
ggaacctttt 
atcgtgct tt 
gcttgccagg 
tggcaggtgg 
aagacatgct 
aggtaaaagt 
cagtcaagga 
tgaactaccc 
cgtaa 



qqtaqct tqt 
agccactt eg 
ttgtgatagt 
acccatgaat 
caattcttta 
tttgaactta 
ecaaegt caa 
ccaagct aaa 
tagttatttg 
tgagaatgt t 
ggct aaggag 
gatgtttgac 
agggaaagtc 
agaagecttg 



Figure 28 



(SEQ 



1 
61 
121 



MIKRCK6IGL ALMAFFLVA C VNQHPKTAKE TEQQR1 VATS 
KLYTLPKRYD AVKRVGLPMN PDIELIASLK PTWILSPNSL 
RSVEGMYQSI DDLGNLFQRQ QEAKELRQQY QDYYRAFQAK 
181 VATNQSYVGN LLDLAGGENV YQSDEKEFLS ANPEDMLAKE 
241 KEFAENDIWK HFTAVKEGKV YDLDNTLFGM SAKLNYPEAL 
ID NO: 42) 



VAVVDICDRL NLDLVGVCDS 
QEDLEPKYQK LDTEYGFLNL 
RKGKKKPIWL ILMGLPGSYL 
PDLILRTAHA IPDKVKVMFD 
DTLTQLFDRV GDHP 



Figure 29 



1 atgataaaac gatqt aaaqq aattqqtcta qtcttaatqq ccttcttttt qqtaqct tqt 



61 gtaaatcagc 

121 gttgctgtgg 

181 aaattatata 

241 cct gatatag 

301 caagaagatt 

3 61 cgaagtgttg 

421 caagaagcaa 

481 cgtaagggga 

541 gtggcgacga 

601 tatcagtcag 

661 cctgacttga 

721 aaagaatttg 

781 tatgatttgg 
841 . gacaccttaa 

(SEQ ID NO: 43) 



accctaaaac 
ttgatatctg 
cccttcctaa 
agttgattgc 
tggaacccaa 
agggcatgta 
aagaattgcg 
agaaaaagcc 
accaatctta 
atgagaaaga 
ttttacgaac 
ctgaaaatga 
acaataccct 
cacagctttt 



ggctaaagag 
tgaccgttta 
acgctatgat 
ttctttgaaa 
gtatcaaaaa 
ccagtctatt 
ccagcaatac 
taaagtgctt 
tgtagggaat 
atttctatca 
agctcacgcc 
tatttggaaa 
gtttggcatg 
tgaccacgtg 



actgaacagc 
aatt t agacc 
gctgttaagc 
ccaacttgga 
ttggatactg 
gatgatttag 
caggactatt 
attcttatgg 
cttttggact 
gttaatcctg 
attccagaca 
cattttacgg 
agtgctaaat 
ggagatcatc 



agagaattgt 
tcgttg-gggt 
gtgtgggttt 
ttttgagt.ee 
agtatggttt 
ggaacctttt 
atcgtgettt 
gcttgecagg 
tggcaggtgg 
aagacatgct 
aggtaaaagt 
cagtcaagga 
tgaactaccc 
cgtaa 



agccacttcg 
ttgtgatagt 
acccatgaat 
caattcttta 
tttgaactta 
ccaacgtcaa 
ccaagctaaa 
tagttatttg 
tgagaatgtt 
agctaaggag 
gatgtttgac 
agggaaagtc 
agaagecttg 



Figure 30 

1 MIKRCKGIGL VLMAFFLVA C VNQHPKTAKE TEQQRIVATS VAVVDICDRL NLDLVGVCDS 

61 KLYTLPKRYD AVKRVGLPMN PDIELIASLK PTWILSPNSL QEDLEPKYQK LDTEYGFLNL 

121 RSVEGMYQSI DDLGNLFQRQ QEAKELRQQY QDYYRAFQAK RKGKKKPKVL ILMGLPGSYL 

181 VATNQSYVGN LLDLAGGENV YQSDEKEFLS VNPEDMLAKE PDLILRTAHA IPDKVKVMFD 

241 KEFAENDIWK HFTAVKEGKV YDLDNTLFGM SAKLNYPEAL DTLTQLFDHV GDHP 
(SEQ ID NO: 44) 
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Figure 31 



ATCC7 002 94 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATGTCACGTA 
ATGTCACGTA 
ATGTCACGTA 
ATGTCACGTA 
ATGTC7\CGTA 



TTGGTAATAA 
TTGGTAATAA 
TTGGTAATAA 
TTGGTAATAA 
TTGGTAATAA 



AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 
AGTAATTACT 



ATGCCTGCAG 
ATGCCTGCAG 
ATGCCTGCAG 
ATGCCTGCAG 
ATGCCTGCAG 



50 

GCGTTGAATT 
GCGTTGAATT 
GCGTTGAATT 
GCGTTGAATT 
GTGTTGAATT 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



51 

AACAAATAAC 
AACAAATAAC 
AACAAATAAC 
AACAAATAAC 
AACAAATAAC 



AACAATGTTA 
AACAATGTTA 
AACAATGTTA 
AACAATGTTA 
AACAATGTTA 



TTACTGTTAA 
TTACTGTTAA 
TTACTGTTAA 
TTACTGTTAA 
TTACTGTTAA 



AGGCCCTAAA 
AGGCCCTAAA 
AGGCCCTAAA 
AGGCCCTAAA 
AGGCCCTAAA 



100 

GGCGAACTCA 
GGCGAACTCA 
GGCGAACTCA 
GGCGAACTCA 
GGCGAACTCA 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



101 

CTCGTGAGTT 
G'TCGTGAGTT 
CTCGTGAGTT 
CTCGTGAGTT 
CTCGTGAGTT 



CAACAAAAAT 
TAACAAAAAT 
TAACAAAAAT 
CAACAAAAAT 
CAACAAAAAT 



ATTGAAATCA 
ATTGAAATCA 
ATTGAAATCA 
ATTGAAATCA 
ATTGAAATCA 



AAGTTGAAGG 
AAGTTGAAGG 
AAGTTGAAGG 
AAGTTGAAGG 
AAGTTGAAGG 



150 

GACTGAAATC 
GACTGAAATC 
GACTGAAATC 
GACTGAAATC 
GACTGAAATC 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



151 

ACAGTTGTAC 
ACAGTTGTAC 
ACAGTTGTAC 
ACAGTTGTAC 
ACAGTTGTAC 



GTCCTAACGA 
GTCCTAACGA 
GTCCTAACGA 
GTCCTAACGA 
GTCCTAACGA 



CTCAAAAGAA 
CTCAAAAGAA 
CTCAAAAGAA 
CTCAAAAGAA 
CTCAAAAGAA 



ATGAAAACAA 
ATGAAAACAA 
ATGAAAACAA 
ATGAAAACAA 
ATGAAAACAA 



200 

TCCATGGT AC 
TCCATGGTAC 
TCCATGGT AC 
TCCATGGGAC 
TCCATGGTAC 



ATCC700294 
MGAS315 . 
SSI-1 
Manf redo 
MGAS8232 



201 

AACCCGTGCT 
AACCCGTGCT 
AACCCGTGCT 
AACCCGTGCT 
AACCCGTGCT 



AACTTGAATA 
AACTTGAATA 
AACTTGAATA 
AACTTGAATA 
AACTTGAATA 



ACATGGTTGT 
ACATGGTTGT 
ACATGGTTGT 
ACATGGTTGT 
ACATGGTTGT 



AGGTGTTTCT 
AGGTGTTTCT 
AGGTGTTTCT 
AGGTGTTTCT 
AGGTGTTTCT 



250 

GAAGGTTTCA 
GAAGGTTTCA 
GAAGGTTTCA 
GAAGGTTTCA 
GAAGGTTTCA 



ATCC7 00294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



251 

AAAAAGATCT 
AAAAAGATCT 
AAAAAGATCT 
AAAAAGATCT 
AAAAAGATCT 



TGAAATGAAG 
T GAAAT G AAG 
TGAAATGAAG 
TGAAATGAAG 
TGAAATGAAG 



GGTGTCGGTT 
GGTGTCGGTT 
GGTGTCGGTT 
GGTGTCGGTT 
GGTGTCGGTT 



ACCGTGCTCA 
ACCGTGCTCA 
ACCGTGCTCA 
ACCGCGCTCA 
ACCGTGCTCA 



300 

ACTTCAAGGT 
ACTTCAAGGT 
ACTTCAAGGT 
ACTTCAAGGT 
ACTTCAAGGT 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



301 

ACTAAACTTG 
ACTAAACTTG 
ACTAAACTTG 
ACTAAACTTG 
ACTAAACTTG 



TCCTTTCAGT 
TCCTTTCAGT 
TCCTTTCAGT 
TCCTTTCAGT 
TCCTTTCAGT 



AGGTAAATCT 
AGGTAAATCT 
AGGTAAATCT 
AGGTAAATCT 
AGGTAAATCT 



CACCAAGACG 
CACCAAGACG 
CACCAAGACG 
CACCAAGACG 
CACCAAGACG 



350 

AAGTTGAAGG 
AAGTTGAAGC 
AAGTTGAAGG 
AAGTTGAAGC 
AAGTTGAAGC 



ATCC700294 
MGAS315 
SSI-1 
Manfredo 
MGAS8232 - 



351 

TCCAGAAGGA 
TCCAGAAGGA 
TCCAGAAGGA 
TCCAGAAGGA 
TCCAGAAGGA 



ATTACTTTCA 
ATTACTTTCA 
ATTACTTTCA 
ATTACTTTCA 
ATTACTTTCA 



CTGTTGCTAA 
CTGTTGCTAA 
CTGTTGCTAA 
CTGTTGCTAA 
CTGTTGCTAA 



CCCAACTTCA 
CCCAACTTCA 
CCCAACTTCA 
CCCAACTTCA 
CCCAACTTCA 



400 

ATCTCAGTTG 
ATCTCAGTTG 
ATCTCAGTTG 
ATCTCAGTTG 
ATCTCAGTTG 
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Figure 31 (contmuea; 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



403 

AAGGAATCAA CAAAGAAGTT GTTGGTCAAA 
AAGGAATCAA CAAAGAAGTT GTTGGTCAAA 
AAGGAATCAA CAAAGAAGTT GTTGGTCAAA 
AAGGAATCAA CAAAGAAGTT GTTGGTCAAA 
AAGGAATCAA CAAAGAAGTT GTTGGTCAAA 

451 

TTGCGTTCAC CAGAGCCTTA CAAAGGCAAA 
TTGCGTTCAC CAGAGCCTTA CAAAGGCAAA 
TTGCGTTCAC CAGAGCCTTA CAAAGGCAAA 
TTGCGTTCAC CAGAGCCTTA CAAAGGCAAA 
TTGCGTTCAC CAGAGCCTTA CAAAGGCAAA 



CAGCTGCTTA 
CAGCTGCTTA 
CAGCTGCTTA 
CAGCTGCTTA 
CAGCTGCTTA 



GGGATCCGTT 
GGGATCCGTT 
GGGATCCGTT 
GGGATCCGTT 
GGGATCCGTT 



450 

CATCCGTAGC 
CATCCGTAGC 
CATCCGTAGC 
CATCCGTAGC 
CATCCGTAGC 

500 

ACGTTGGTGA 
ACGTTGGTGA 
ACGTTGGTGA 
ACGTTGGTGA 
ACGTTGGTGA 



501 537 
ATCC700294 ATACGTACGC CTTAAAGAAG GTAAAACAGG TAAATAA 
MGAS315 ATACGTACGC CTTAAAGAAG GTAAAACAGG TAAATAA 

SSI-1 ATACGTACGC CTTAAAGAAG GTAAAACAGG TAAATAA 

Manfredo ATACGTACGC CTTAAAGAAG GTAAAACAGG TAAATAA 
MGAS8232 ATACGTACGC CTTAAAGAAG GTAAAACAGG TAAATAA 



Figure 32 

ATCC700294 

MGASS15 

SSI-1 

Manfredo 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATCC7002 94 
MGAS315 
SSI-1 
Manfredo . 
MGAS8232 



MSRIGNKVIT 
MSRIGNKVIT 
MSRIGNKVIT 
MSRIGNKVIT 
MSRIGNKVIT 

51 

TWRPNDSKE 
TVVRPNDSKE 
TWRPNDSKE 
TWRPNDSKE 
TVVRPNDSKE 

101 

TKLVLSVGKS 
TKLVLSVGKS 
TKLVLSVGKS 
.. TKLVLSVGKS 
TKLVLSVGKS 



MPAGVELTNN 
MPAGVELTNN 
MPAGVELTNN 
MPAGVELTNN 
MPAGVELTNN 



NNVITVKGPK 
NNVI TVKGPK 
NNVITVKGPK 
NNVITVKGPK 
NNVITVKGPK 



MKTIHGTTRA 
MKTIHGTTRA 
MKTIHGTTRA 
MKTIHGTTRA 
MKTIHGTTRA 



HQDEVEAPEG 
HQDEVEAPEG 
HQDEVEAPEG 
HQDEVEAPEG 
HQDEVEAPEG 



NLNNMWGVS 
NLNNMWGVS 
NLNNMWGVS 
NLNNMWGVS 
NLNNMWGVS 



ITFTVANPTS 
ITFTVANPTS 
ITFTVANPTS 
ITFTVANPTS 
ITFTVANPTS 



GELTREFNKN 
GELTREFNKN 
GELTREFNKN 
GELTREFNKN 
GELTREFNKN 



EGFKKDLEMK 
EGFKKDLEMK 
EGFKKDLEMK 
EGFKKDLEMK 
EGFKKDLEMK 



ISVEGINKEV 
ISVEGINKEV 
ISVEGINKEV 
ISVEGINKEV 
ISVEGINKEV 



50 

IEIKVEGTEI 
IEIKVEGTEI 
IEIKVEGTEI 
IEIKVEGTEI 
IEIKVEGTEI 

100 

GVGYRAQLQG 
GVGYRAQLQG 
GVGYRAQLQG 
GVGYRAQLQG 
GVGYRAQLQG 

150 

VGQTAAYIRS 
VGQTAAYIRS 
VGQTAAYIRS 
VGQTAAYIRS 
VGQTAAYIRS 



151 178 
ATCC7O0294 LRSPEPYKGK GIRYVGEYVR LKEGKTGK 
MGAS315 LRSPEPYKGK GIRYVGEYVR LKEGKTGK 

SSI-1 LRSPEPYKGK GIRYVGEYVR LKEGKTGK 

Manfredo LRSPEPYKGK GIRYVGEYVR LKEGKTGK 
MGAS8232 LRSPEPYKGK GIRYVGEYVR LKEGKTGK 
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Figure 33 

ATCC700294 
MG AS 315 
SS1-1 
' Manfredo 
MGAS8232 



1 50 
ATGTTTCAGT TAAGAAAAAA AATGACGCGC AAACAATTAG CCTTGTTGAG 
ATGTTTCAGT TAAGAAAAAA AATGACGCGC AAAC AAT TAG CCTTGTTGAG 
ATGTTTCAGT TAAGAAAAAA AATGACGCGC AAACAATTAG CCTTGTTGAG 
ATGTTTCAGT TAAGAAAAAA AATGACGCGC AAACAATTAG CCTTGTTGAG 
ATGTTTCAGT TAAGAAAAAA AATGACGCGC AAACAATTAG CCTTGTTGAG 



ATCC700294 
MGAS315 
SSI-1 
Manfredo 
MGAS8 2 32 



51 

TGCTGGAGTG 
TGCTGGAGTG 
TGCTGGAGTG 
TGCTGGAGTG 
TGCTGGAGTG 



TTGACCTGTG 
TTGACCTGTG 
TTGACCTGTG 
TTGACCTGTG 
TTGACCTGTG 



TGGTTGGTGG 
TGGTTGGTGG 
TGGTTGGTGG 
TGGTTGGTGG 
TGGTTGGTGG 



TAGCTACTTG 
TAGCTACTTG 
TAGCTACTTG 
TAGCTACTTG 
TACCTACTTG 



100 

ATAATGAACC 
ATAATGAACC 
ATAATGAACC 
ATAATGAACC 
ATAATGAATC 



ATCC700294 
MGAS315 
SSI-1 
Manfredo 
MGAS82 32 



101 

ATCAACAACA 
AT C AACAACA 
ATCAACAACA 
ATCAACAACA 
ATCAACAACA 



AGAAATTGTC 
AGAAATTGTC 
AGAAATTGTC 
AGAAGTTGTC 
AGAAATTGTC 



TCTAGTGTCA 
TCTAGTGTCA 
TCTAGTGTCA 
TCTAGTGTCA 
TCTAGTGTCA 



ACAAAGTAAA 
ACAAAGTAAA 
ACAAAGTAAA 
ACAAAGTAAA 
ACAAAGTAAA 



150 

AGCCTTAACC 
AGCCTTAACC 
AGCCTTAACC 
AGCCTTAACC 
AGCCTTAACC 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



151 

ATAAAAGAAG 
ATAAAAGAAG 
ATAAAAGAAG 
ATAAAAGAAG 
A T AAAAG AAG 



CCATGGAACA 
CCATGGAACA 
CCATGGAACA 
CCATGGAACA 
CCATGGAACA 



AGGAAAAGAT 
AGGAAAAGAT 
AGGAAAAGAT 
AGGAAAAGAT 
AGGAAAAGAT 



ATCAGCTTGA 
ATCAGCTTGA 
ATCAGCTTGA 
ATCAGCTTGA 
ATCAGCTTGA 



200 

CCTTAGCTGG 
CCTTAGCTGG 
CCTTAGCTGG 
CCTTAGCTGG 
CCTTAGCTGG 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



201 

CGAAGTAACA 
CGAAGTAACA 
CGAAGTAACA 
CGAAGTAACA 
CGAAGTAACA 



GCTAACAACA 
GCTAACAACA 
GCTAACAACA 
GCTAACAACA 
GCTAACAACA 



GCAGCAAAGT 
GCAGCAAAGT 
GCAGCAAAGT 
GCAGCAAAGT 
GCAGCAAAGT 



CAAAATCGAC 
CAAAATCGAC 
CAAAATCGAC 
CAAAATCGAC 
CAAAATCGAC 



250 

TCAAGTAAAG 
TCAAGTAAAG 
TCAAGTAAAG 
TCAAGTAAAG 
TCAAGTAAAG 



ATCC70 02 94 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



251 

GAGAAGTCAA 
GAGAAGTCAA 
GAGAAGTCAA 
G AG AAGT C AA 
GAGAAGTCAA 



AGAGGTCTTT 
AGAGGTCTTT 
AGAGGTCTTT 
AGAGGTCTTT 
AGATGTCTTT 



CTTAAAAAAG 
GTTAAAAAAG 
GTTAAAAAAG 
GTCAAAAAAG 
GTCAAAAAAG 



GCGATGTTGT 
GCGATGTTGT 
GCGATGTTGT 
GCGATGTTGT 
GCGATGTTGT 



300 

CAAAGTAGGA 
CAAAGTAGGA 
CAAAGTAGGA 
CAAAGTAGGA 
CAAAGTAGGA 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



301 

CAACCCTTGT 
CAACCCTTGT 
CAACCCTTGT 
CAACCCTTGT 
CAACCCTTGT 



TTAGCTATGA 
TTAGCTATGA 
TTAGCTATGA 
TTAGCTATGA 
TTAGCTATGA 



AACG TCAGAG 
AACGTCACAG 
AACGTCACAG 
AACGTCACAG 
AACGTCACAA 



CGGTTAACGG 
CGGTTAACGG 
CGGTTAACGG 
CGGTTAACGG 
CGGTTAACGG 



350 

CTCAAAGTTC 
CTCAAAGTTC 
CTCAAAGTTC 
CTCAAAGTTC 
CTCAAAGTTC 



ATCC7 00294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



351 

AGAATTTGAT 
AG AAT TT GAT 
AGAATTTGAT 
AGAATTTGAT 
AGAATTTGAT 



GTTCAAACCA 
GTTCAAACCA 
GTTCAAACCA 
GTTCAAACCA 
GTTCAAACCA 



AAGCCAATCA 
AAGCCAATCA 
AAGCCAATCA 
AAGCCAATCA 
AAGCCAATCA 



GCTCCAAGTT 
GCTCCAAGTT 
GCTCCAAGTT 
ACTCCAAGTT 
ACTCCAAGTT 



400 

GCTAAAACCA 
GCTAAAACCA 
GCTAAAACCA 
GCTAAAACCA 
GCTAAAACCA 
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Figure 33 (continued) 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



401 

ATGCAGCATT 
ATGCAGCATT 
ATGCAGCATT 
ATGCAGCATT 
ATGCAGCATT 



GAAGTGGGAA 
GAAGTGGGAA 
GAAGTGGGAA 
GAAGTGGGAA 
GAAGTGGGAA 



ACCTACAATC 
ACCTACAATC 
ACCTACAATC 
ACCTACAATC 
ACCTACAATC 



GC AAGGTCAA 
GCAAGGTCAA 
GCAAGGTCAA 
GCAAGGTCAA 
GCAAGGTCAA 



450 

TGAAATCAAC 
TGAAATCAAC 
TGAAATCAAC 
TGAAATCAAC 
TGAAATCAAT 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



4 51 

ACCCTAAAAT 
ACCCTAAAAT 
ACCCTAAAAT 
ACCCTAAAAT 
ACCCTAAAAT 



CTCGCTACAA 
CTCGCTACAA 
CTCGCTACAA 
CTCGCTACAA 
CTCGCTACAA 



CACTGCACCA 
CACTGCACCA 
CACTGCACCA 
CACTGCACCA 
CACTGCACCA 



GATGAGAGCT 
GATGAGAGCT 
GATGAGAGCT 
GATGAGAGCT 
GATGAGAGCT 



5Q0 

TACTAGAGCA 
TACTAGAGCA 
TACTAGAGCA 
TACTAGAGCA 
TACTAGAGCA 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



501 

AATTCGCAGC 
AATTCGCAGC 
AATTCGCAGC 
AATTCGCAGC 
AATTCGCAGC 



GCAGAAGACA 
GCAGAAGACA 
GCAGAAGACA 
GCAGAAGACA 
GCAGAAGACA 



GTGTATCCCA 
GTGTATCCCA 
GTGTATCCCA 
GTGTATCCCA 
GTGTATCTCA 



AGCACTAAGC 
AGCACTAAGC 
AGCACTAAGC 
AGCACTAAGC 
AGCACTAAGC 



550 

GATGCCAAAA 
GATGCCAAAA 
GATGCCAAAA 
GATGCCAAAA 
GATGCCAAAA 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



551 

C AG C AG AT AG 
CAGCAGATAG 
CAGCAGATAG 
CAGCAGATAG 
CAGCAGATAG 



CGATGTCAAA 
CGATGTCAAA 
CGATGTCAAA 
CGATGTCAAA 
CGATGTCAAA 



ACCGCTCAAA 
ACCGCTCAAA 
ACCGCTCAAA 
ACCGCTCAAA 
ACCGCTCAAA 



TCGAACTCGA 
TCGAACTCGA 
TCGAACTCGA 
TCGAACTCGA 
TCGAACTCGA 



600 

TAAAGCTAAT 
TAAAGCTAAT 
: TAAAGCTAAT 
TAAAGCTAAT 
TAAAGCTAAT 



ATCC7 002 94 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



601 

GCTACTGCCA 
GCTACTGCCA 
GCTACTGCCA 
GCTACTGCCA 
GCTACTGCCG 



CAACGGAAAA 
CAACGGAAAA 
CAACGGAAAA 
CAATGGAAAA 
CAACGGAAAA 



AGGTAAACTA 
AG G T AAAC T A 
AGGTAAACTA 
AGGTAAACTA 
AGGTAAACTA 



GAGTATGACA 
GAGTATGACA 
GAGTATGACA 
GAGTATGACA 
GAGTATGACA 



650 

CCGTTAAGTC 
CCGTTAAGTC 
CCGTTAAGTC 
CCGTTAAGTC 
CCGTTAAGTC 



ATCC700294 
MGAS315 - 
SSI-1 
Manfredo 
MGAS8232 



651 

AGACACCGCA 
AGACACCGCA 
AGACACCGCA 
AGACACCGCA 
AGACACCGCA 



GGAACCATTG 
GGAACCATTG 
GGAACCATTG 
GGAACCATTG 
GGAACCATTG 



TTAGTCTAAA 
TTAGTCTAAA 
TTAGTCTAAA 
TTAGCCTAAA 
TTAGTCTAAA 



TACTGATTTG 
TACTGATTTG 
TACTGATTTG 
TACTGATTTG 
TACTGATTTG 



700 

CCAAATCAAT 
CCAAATCAAT 
CCAAATCAAT 
CCAAATCAAT 
CCAAATCAAT 



ATCC700294 
MGAS315 
. SSI-1 
Manfredo 
MGAS8232 



701 

CAAAATCCAA 
CATW^TCCAA 
CAAAATCCAA 
CAAAATCCAA 
CAAAATCCAA 



AAAAGAAAAT 
AAAAGAAAAT 
AAAAGAAAAT 
AAAAGAAAAT 
AAAAGAAAAT 



GAAACTTTTA; 
GAAACTTTTA 
GAAACTTTTA 
GAAACTTTTA 
GAAACTTTTA 



TGGAAATTAT 
TGGAAATTAT 
TGGAAATTAT 
TGGAAATTAT 
TGGAAATTAT 



7 50 

OGACAAATCA 
CGACAAATCA 
CG AC71AATCA 
CGACAAATCA 
CGACAAATCA 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



751 

AAAATGTTAG 
AAAATGTTAG 
AAAATGTTAG 
AAAATGTTAG 
AAAATGTTAG 



TCAAAGGTAA 
TCAAAGGTAA 
TCAAAGGTAA 
TCAAAGGTAA 
TCAAAGGTAA 



CATTAGTGAA 
CATTAGTGAA 
CATTAGTGAA 
CATCAGTGAA 
CATCAGTGAA 



TTTGACCGTG 
TTTGACCGTG 
TTTGACCGTG 
TTTGACCGTG 
TTTGACCGTG 



8O0 

ACAAGTTAAA 
ACAAG T T AAA 
ACAAGTTAAA 
ACAAGTTAAA 
ACAAGTTAAA 
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gure 33 (continued) 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



801 

AATCGGTCAA 
AATCGGTCAA 
AATCGGTCAA 
AATCGATCAA 
AATCGATCAA 



AAAGTCGAAG 
AAAGTCGAAG 
AAAGTCGAAG 
AAAGTCGAAG 
AAAGTCGAAG 



TGATTGACCG 
TGATTGACCG 
TGATTGACCG 
TGATTGACCG 
TGATTGACCG 



CAAAGACAAC 
CAAAGACAAC 
CAAAGACAAC 
CAAAGACAAC 
CAAAGACAAC 



8 SO 

TCTAAAAAAT 
TCTAAAAAAT 
TCTAAAAAAT 
TCTAAAAAAT 
TCTAAAAAAT 



851 900 

ATCC7 002 94 GGACTGGAAA AGTAACCCAA GTTGGCAACC TCAAAGCAGA GGAAAAAGGC 

MGAS315 GGACTGGAAA AGTAACCCAA GTTGGCAACC TCAAAGCAGA GGAAAAAGGC 

SSI-1 GGACTGGAAA AGTAACCCAA GTTGGCAACC TCAAAGCAGA GGAAAAAGGC 

Manf redo GGACTGGAAA AGTAACCCAA GTTGGCAACC TCAAAGCAGA GGAAAAAGGC 

MGAS8232 GGACTGGAAA AGTAACCCAA GTTGGCAACC TCAAAGCAGA GGAAAAAGGC 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



901 

CAAGGTCAAG 
CAAGGTCAAG 
CAAGGTCAAG 
CAAGGTCAAG 
CAAGGTCAAG 



GCCAAGGTGG 
GCCAAGGTGG 
GCCAAGGTGG 
GCCAAGGTGG 
GCCAAGGTGG 



CAATGACCAA 
CAATGACCAA 
CAATGACCAA 
CAATGACCAA 
CAATGACCAA 



CAAGATAATC 
CAAGATAATC 
CAAGATAATC 
CAAGACAATC 
CAAGATAATC 



950 

CAAACCAAGC 
CAAACCAAGC 
CAAACCAAGC 
CAAACCAAGC 
CAAACCAAGC 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



951 

AAAATTCCCT 
AAAATTCCCT 
AAAATTCCCT 
AAAATTCCCT 
AAAATTCCCT 



TATGTTATTG 
TATGTTATTG 
TATGTTATTG 
TATGTTATCG 
TATGTTATCG 



AACTTGACCA 
AACTTGACCA 
AACTTGACCA 
AACTTGACCA 
AACTTGACCA 



ATCAGACAAG 
ATCAGACAAG 
ATCAGACAAG 
ATCAGACAAG 
ATCAGACAAG 



1000 
CAGCCACTCA 
CAGCCACTCA 
CAGCCACTCA 
CAGCCACTCA 
CAGCCACTCA 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



1001 

TTGGCTCACA 
TTGGCTCACA 
TTGGCTCACA 
TTGGCTCACA 
TTGGTTCACA 

1051 

AAGATCGTAT 
AAGATCGTAT 
AAGATCGTAT 
AAGATCGTAT 
AAGATCGTAT 



CACCTATGTT 
CACCTATGTT 
CACCTATGTT 
CACCTATGTT 
CACCTATGTT 



TGAAAGAAAC 
TGAAAGAAAC 
TGAAAGAAAC 
TGAAAGAAAC 
TGAAAGAAAC 



AATGTGCTCA 
AATGTACTCA 
AATGTACTCA 
AATGTGCTCA 
AATGTGCTCA, 



CTTTACAATG 
CTTTACAATG 
CTTTACAATG 
CTTTACAATG 
CTTTACAATG 



ACAATGTTCC 
ACAATGTTCC 
ACAATGTTCC 
ACAATGTTCC 
ACAATGTTCC 



GCAGAAAATG 
GCAGAAAATG 
GCAGAAAATG 
GCAGAAAATG 
GCAGAAAATG 



1050 
AGAAGCTGGC 
AGAAGCTGGC 
AGAAGCTGGC 
AG AAGC TGGC 
AGAAGCTGGC 

1100 
GAAAAACCTA 
GAAAAACCTA 
GAAAAACCTA 
GAAAAACCTA 
GAAAAACCTA 



ATCC700294 
MGAS315 
SSI-1 
Manfredo 
-MGAS8232 



Manfredo 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



1101 

TGTGTGGAAA 
TGTGTGGAAA 
TGTGTGGAAA 
TGTGTGGAAA 
TGTGTGGAAA 

1151 

AGCCCTTCTC 
AGCCCTTCTC 
AGCCCTTCTC 
AGCCCTTCTC 
AGCCCTTCTC 



GTTGATAAAA 
GTTGATAAAA 
GTTGATAAAA 
GTTGATAAAA 
GTTGATAAAA 



AAAAGGTTAT 
AAAAGGTTAT 
AAAAGGTTAT 
AAAAGGTTAT 
AAAAGGTTAT 



acaaggtcaa 
acaaggtcaA 
acaaggtcaa 
acaaggtcaa 
acaaggtcaa 



GTTGAGGTAA 
GTTGAGGTAA 
GTTGAGGTAA 
GTTGAGGTGA 
GTTGAGG.TGA 



AAAACAAGAA 
AAAACAAGAA 
AAAACAAGAA 
AAAACAAGAA 
AAAACAAGAA 



CAAGTGGCTT 
CAAGTGGCTT 
CAAGTGGCTT 
CAAGTGGCTT 
CAAGCGGCTT 



1150 
AT C AAG ACT A 
ATCAAGACTA 
ATCAAGACTA 
ATCAAGACTA 
ATCAAGACTA 

1200 
GACTATGCAA 
GACTATGCAA 
GACTATGCAA 
GACTATGCAA 
GACTATGCAA 
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Figure 3 3 ( continued ) 



ATCC7 002 94 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



1201 

GATAAGATTG 
GATAAGATTG 
GATAAGATTG 
GATAAGATTG 
GATAAGATTG 



CTCAGCCGCT TCCTGGCATG 
CTCAGCCGCT TCCTGGCATG 
CTCAGCCGCT TCCTGGCATG 
CTCAGCCGCT TCCTGGCATG 
CTCAGCCGCT TCCTGGCATG 



AAAGACGGTA 
AAAGACGGTA 
AAAGACGGTA 
AAAGACGGTA 
AAAGACGGTA 



1250 
TGGAGGTAGG 
TGGAGGTAGG 
TGGAGGTAGG 
TGGAGGTAGG 
TGGAGGTAGG 



1251 1269 
ATCC700294 AAGTATTGTT AAACCTTAA 
MGAS315 AAGTATTGTT AAACCTTAA 

SSI-1 AAGTATTGTT AAACCTTAA 

Manfredo AAGTATTGTT AAACCTTAA 
MGAS8232 AAGTATTGTT AAACCTTAA 



Figure 34 

ATCC700294 
MGAS315 
SSI-1 
Manfredo 
MGAS82 32 



MFQLRKKMTR 
MFQLRKKMTR 
MFQLRKKMTR 
MFQLRKKMTR 
MFQLRKKMTR 



KQLALLSAGV 
KQLALLSAGV 
KQLALLSAGV 
KQLALLSAGV 
KQLALLSAGV 



LTCVVGGSYL 
LTCVVGGSYL 
LTCVVGGSYL 
LTCVVGGSYL 
LTCWGGTYL 



IMNHQQQEIV 
IMNHQQQEIV 
IMNHQQQEIV 
IMNHQQQEVV 
IMNHQQQEIV 



50 

SSVNKVKALT 
SSVNKVKALT 
SSVNKVKALT 
SSVNKVKALT 
SSVNKVKALT 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



51 

IKEAMEQGKD 
IKEAMEQGKD 
IKEAMEQGKD 
IKEAMEQGKD 
IKEAMEQGKD : 



ISLTLAGEVT 
ISLTLAGEVT 
ISLTLAGEVT 
ISLTLAGEVT 
ISLTLAGEVT 



ANNSSKVKID 
ANNSSKVKID 
ANNSSKVKID 
ANNSSKVKID 
ANNSSKVKID 



SSKGEVKEVF 
SSKGEVKEVF 
SSKGEVKEVF 
SSKGEVKEVF 
SSKGEVKDVF 



100 

VKKGDVVKVG 
VKKGDVVKVG 
VKKGDVVKVG 
VKKGDVVKVG 
VKKGDVVKVG 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



101 

QPLFSYETSQ 
QPLFSYETSQ 
QPLFSYETSQ 
QPLFSYETSQ 
QPLFSYETSQ 



RLTAQSSEFD 
RLTAQSSEFD 
RLTAQSSEFD 
RLTAQSSEFD 
RLTAQSSEFD 



VQTKANQLQV 
VQTKANQLQV 
VQTKANQLQV 
VQTKANQLQV 
VQTKANQLQV 



AKTNAALKWE 
AKTNAALKWE 
AKTNAALKWE 
AKTNAALKWE 
AKTNAALKWE 



150 . 
TYNRKVNEIN 
TYNRKVNEIN 
TYNRKVNEIN 
TYNRKVNEIN 
TYNRKVNEIN 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



151 

TLKSRYNTAP 
TLKSRYNTAP 
TLKSRYNTAP 
TLKSRYNTAP 
TLKSRYNTAP 



DESLLEQIRS 
DESLLEQIRS 
DESLLEQIRS 
DESLLEQIRS 
DESLLEQIRS 



AEDSVSQALS 

AEDSVSQALS. 

AEDSVSQALS 

AEDSVSQALS 

AEDSVSQALS 



DAKTADS DVK 
DAKTADS DVK 
DAKTADSDVK 
DAKTADS DVK 
DAKTADS DVK 



200 

TAQIELDKAN 
TAQIELDKAN 
TAQIELDKAN 
TAQIELDKAN 
TAQIELDKAN 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



201 

ATATTEKGKL 
ATATTEKGKL 
ATATTEKGKL ■ 
ATATMEKGKL 
ATAATEKGKL 



EYDTVKSDTA 
EYDTVKSDTA 
EYDTVKSDTA 
EYDTVKSDTA 
EYDTVKSDTA 



GTIVSLNTDL 
GTIVSLNTDL 
GTIVSLNTDL, 
GTIVSLNTDL 
GTIVSLNTDL 



PNQSKSKKEN 
PNQSKSKKEN 
PNQSKSKKEN 
PNQSKSKKEN 
PNQSKSKKEN 



250 

ETFMEIIDKS 
ETFMEIIDKS 
ETFMEIIDKS 
ETFMEIIDKS. 
ETFMEIIDKS 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



251 

KMLVKGNISE 
KMLVKGNISE 
KMLVKGNISE 
KMLVKGNISE 
KMLVKGNISE 



FDRDKLKIGQ 
FDRDKLKPGQ 
FDRDKLKIGQ 
FDRDKLKIDQ 
FDRDKLKIDQ 



KVEVIDRKDN 
KVEVIDRKDN 
KVEVIDRKDN 
KVEVIDRKDN 
KVEVIDRKDN 



SKKWTGKVTQ 
SKKWTGKVTQ 
SKKWTGKVTQ 
SKKWTGKVTQ 
SKKWTGKVTQ 



300 

VGNLKAEEKG 
VGNLKAEEKG 
VGNLKAEEKG 
VGNLKAEEKG 
VGNLKAEEKG 
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Figure 34 (continued) 



ATCC7 002 94 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



ATCC700294 
MGAS315 
SSI-1 
Manf redo 
MGAS8232 



ATCC7 002 94 
MGAS315 . 
SSI-1 
Manf redo 
MGAS8232 



301 

QGQGQGGNDQ QDNPNQAKFP 
QGQGQGGNDQ QDNPNQAKFP 
QGQGQGGNDQ QDNPNQAKFP 
QGQGQGGNDQ QDNPNQAKFP 
QGQGQGGNDQ QDNPNQAKFP 



351 

KIVLKETFTM 
KIVLKETFTM 
KIVLKETFTM 
KIVLKETFTM 
KIVLKETFTM 



AENGKTYVWK 
AENGKTYVWK 
AENGKTYVWK 
AENGKTYVWK 
AENGKTYVWK 



YVIELDQSDK 
YVIELDQSDK 
YVIELDQSDK 
YVIELDQSDK 
YVIELDQSDK 



VDKNKVKKQE 
VDKNKVKKQE 
VDKNKVKKQE 
VDKNKVKKQE 
VDKNKVKKQE 



401 4 22 
DKIAQPLPGM KDGMEVGSIV KP 
DKIAQPLPGM KDGMEVGSIV KP 
DKIAQPLPGM KDGMEVGSIV KP 
DKIAQPLPGM KDGMEVGSIV KP 
DKIAQPLPGM KDGMEVGSIV KP 



QPLIGSHTYV 
QPL1GSHTYV 
QPLIGSHTYV 
QPLIGSHTYV 
QPLIGSHTYV 



IKTKPFSKGY 
IKTKPFSKGY 
IKTKPFSKGY 
IKTKPFSKGY 
IKTKPFSKGY 



350 

NVLNNVPEAG 
NVLNNVPEAG 
NVLNNVPEAG 
NVLNNVPEAG 
NVLNNVPEAG 

4 00 

VEVTSGLTMQ 
VEVTSGLTMQ 
VEVTSGLTMQ 
VEVTSGLTMQ 
VEVTSGLTMQ 



Figure 35 

ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo, 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATGATAAAAC 
ATGATAAAAC 
ATGATAAAAC 
ATGATAAAAC 
ATGATAAAAC 

51 

GGTAGCTTGT 
GGTAGCTTGT 
GGTAGCTTGT 
GGTAGCTTGT 
GGTAGCTTGT 

101 

AGAGAATTGT 
AGAGAATTGT 
AGAGAATTGT 
AGAGAATTGT 
AGAGAATTGT 

151 

AATTTAGACC 
AATTTAGACC 
AATTTAGACC 
AATTTAGACC 
AATTTAGACC 

201 

ACGCTATGAT 
ACGCTATGAT 
ACGCTATGAT 
ACGCTATGAT 
ACGCTATGAT 



GATGTAAAGG 
GATGTAAAGG 
GATGTAAAGG 
GATGTAAAGG 
GATGTAAAGG 



GTGAATCAGC 
G T AAA T C AG C 
GTAAATCAGC 
GTGAATCAGC 
GTAAATCAGC 



AGCCACTTCG 
AGCCACTTCG 
AGCCACTTCG 
AGCCACTTCG 
AGCCACTTCG 



TCGTTGGGGT 
TCGTTGGGGT 
TCGTTGGGGT 
TCGTTGGGGT 
TCGTTGGGGT 



AATTGGTCTA 
AATTGGTCTA 
AATTGGTCTA 
AATTGGTCTA 
AATTGGTCTA 



ACCCTAAAAC 
ACCCTAAAAC 
ACCCTAAAAC 
ACCCTAAAAC 
ACCCTAAAAC 



GTTGCTGTGG 
GTTGCTGTGG 
GTTGCTGTGG 
GTTGCTGTGG 
GTTGCTGTGG 



TTGTGATAGT 
TTGTGATAGT 
TTGTGATAGT 
TTGTGATAGT 
TTGTGATAGT 



GCTGTTAAGC 
GCTGTTAAGC 
GCTGTTAAGC 
GCTGTTAAGC 
GCTGTTAAGC 



GTGTGGGTTT 
GTGTGGGTTT 
GTGTGGGTTT 
GTGTGGGTTT 
GTGTGGGTTT 



GCCTTAATGG 
GTCTTAATGG 
GTCTTAATGG 
GCCTTAATGG 
GTCTTAATGG 



GGCTAAAGAG 
GGCTAAAGAG 
GGCTAAAGAG 
GGCTAAAGAG 
GGC TT^AAG AG 



TTGATATCTG 
TTGATATCTG 
TTGATATCTG 
TTGATATCTG 
TTGATATCTG 



AAAT TAT AT A 
AAATTATATA 
AAAT TAT AT A 
AAATTATATA 
AAATTATATA 



ACCCAyG/lAT 
ACCCATGAAT 
ACCCATGAAT 
ACCCATGAAT 
ACCCATGAAT 



50 

CCTTCTTTTT 
CCTTCTTTTT 
CCTTCTTTTT 
CCTTCTTTTT 
CCTTCTTTTT 

100 

ACTGAACAGC 
ACTGAACAGC 
ACTGAACAGC 
ACTGAACAGC 
ACTGAACAGC 

150 

TGACCGTTTA 
TGACCGTTTA 
TGACCGTTTA 
TGACCGTTTA 
TGACCGTTTA 

200 

CCCTTCCTAA 
CCCTTCCTAA 
CCCTTCCTAA 
CCCTTCCTAA 
CCCTTCCTAA 

250 

CCTGATATAG 
CCTGATATAG 
CCTGATATAG 
CCTGATATAG 
CCTGATATAG 
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Figure 35 (continued) 



ATCC700294 

MGAS315 

SSJ-1 

Manfredo 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATCC700294 

MGAS315 

SSI-1 

Manfredo 

MGAS8232 



ATCC7 0 02 94 
MGAS315 . 
SSI-1 . 
Manfredo 
MGAS8232 



651 

GGCTAAGGAG 
AGCTAAGGAG 
AGCTAAGGAG 
GGCTAAGGAG 
AGCTAAGGAG 

701 

AGGTAAAAGT 
AGGTAAAAGT 
AGGTAAAAGT 
AGGTAAAAGT 
AGGTAAAAGT 

751 

CATTTTACGG 
CATTTTACGG 
CATTTTACGG 
CATTTTACGG 
CATTTTACGG 

8 01 

GTTTGGCATG 
GTTTGGCATG 
GTTTGGCATG 
GTTTGGCATG 
GTTTGGCATG 

851 

CACAGCTTTT 
CACAGCTTTT 
CACAGCTTTT 
CACAGCTTTT 
CACAGCTTTT 



CCTGACTTGA 
CCTGACTTGA 
CCTGACTTGA 
CCTGATTTGA 
CCTGACTTGA 



GATGTTTGAC 
GATGTTTGAC 
GATGTTTGAC 
GATGTTTGAC 
GATGTTTGAC 



CAGTCAAGGA 
CAGTCAAGGA 
CAGTCAAGGA 
CAGTCAAGGA 
CAGTCAAGGA 



AGTGCTAAAT 
AGTGCTAAAT 
AGTGCTAAAT 
AGTGCTAAAT 
AGTGCTAAAT 



TGACCACGTG 
TGACCACGTG 
TGACCACGTG 
TGACCGCGTG 
TGACCACGTG 



TTTTACGAAC 
TTTTACGAAC 
TTTTACGAAC 
TTTTACGAAC 
TTTTACGAAC 



AAAGAATTTG 
AAAGAATTTG 
AAAGAATTTG 
AAAGAATTTG 
AAAGAATTTG 
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